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(54) Recombinant chimeric proteins and methods of use thereof 

(57) A chimeric protein having at least one domain 
derived from a physiologically active moiety and at least 
one domain derived from an extracellular matrix protein 
is provided. Physiologically active domains are derived 
from physiologically active moieties such as bone mor- 
phogenic proteins, transforming growth factors, and der- 
matan sulfate proteoglycans. The extracellular matrix 
protein domains are derived from collagen, fibrin, fibro- 
gen, laminins and the like. Recombinant DNA con- 
structs, cloning vectors and transformed cells containing 
DNA which encodes such chimeric proteins are pro- 
vided. Methods of using the chimeric proteins, chimeric 
DNA constructs, cloning vectors containing chimeric 
DNA construct, and cells transformed with the cloning 
vectors are also provided. The chimeric proteins can be 
used as osteogenic agents and/or antiscarring agents. 
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Descripti n 
BACKGROUND 
5 1 . Technical Field 

Chimeric proteins and more particularly chimeric proteins having a domain which is derived from a physiologically active 
moiety and a domain derived from an extracellular matrix protein moiety are provided. Further provided are DNA con- 
structs encoding such chimeric proteins and to methods for preparing such chimeric proteins using recombinant DNA 
10 technology. Methods for healing tissue including inducing scar reduction and formation of bone and/or cartilage are also 
provided. 

2. Description of Related Art 

is Chimeric proteins, also known as fusion proteins, are hybrid proteins which combine two or more precursor proteins or 
peptides through peptide bonds. Fusion proteins may be produced by recombinant technology, i.e., by fusing part of the 
coding sequence of one gene to the coding sequence of another gene. The fused gene may then be used to transform 
a suitable organism which then expresses the fusion protein. Such proteins are usually used to test the function of 
different domains of a protein molecule or to append a locater or binding peptide onto a protein or peptide of interest. 

20 For example, portions upstream and partially downstream of human, rat or mouse collagen genes have been fused to 
other proteins in an attempt to analyze characteristics of transcription. See, e.g., Rossouw, etal. DNA Sequences in the 
First Intron of the Human ProAlpha-1-l Collagen Gene Enhance Transcription, Journal of Biological Chemistry, 262 (31), 
pp. 15151-15157 (1987). Genomic imprinting effects have been analyzed by fusing the gene encoding human keratin 
18 9 protein with the gene encoding beta-galactosidase (LacZ). See Thorex et al. ( Parent-Specific Expression of a 

25 Human Keratin 1 8/beta-galactosidase Fusion Gene in Transgenic Mice, Dev. Dyn. (United States), 195 (2) pp. 100-12 
(Oct. 1 992) . European Patent Application 88302039 describes production and purification of a recombinant protein, e.g. , 
collagen, a linker region which may encode a restriction site, and a binding protein for a substrate. The fusion protein is 
then contacted with a suitable substrate to which it binds and the protein may then be recovered, e.g., from a column. 
Extracellular matrix proteins ("EMPs") are found in spaces around or near cells of multicellular organisms and are 

30 typically fibrous proteins of two functional types: mainly structural, e.g., collagen and elastin, and mainly adhesive, e.g., 
fibronectin and laminin. Collagens are a family of fibrous proteins typically secreted by connective tissue cells. Twenty 
distinct collagen chains have been identified which assemble to form a total of about ten different collagen molecules. 
A general discussion of collagen is provided by Alberts, et al., The Cell, Garland Publishing, pp. 802-823 (1989), incor- 
porated herein by reference. Other fibrous or filamentous proteins include Type I IF proteins, e.g., keratins; Type II IF 

35 proteins, e.g., vimentin, desmin and glial fibrillary acidic protein; Type III IF proteins, e.g., neurofilament proteins; and 
Type IV IF proteins, e.g., nuclear laminins. 

Physiologically active glycoproteins, proteins, peptides and proteoglycans are abundant in living things. Such glyc- 
oproteins, proteins, peptides and proteoglycans are involved in a diverse array of cellular or viral functions which include 
initiation or regulation of metabolism, catabolism, reproduction, growth and repair of various life forms. Physiologically 

40 active glycoproteins, proteins, peptides, and proteoglycans include therapeutically active glycoproteins, proteins, pep- 
tides, and proteoglycans such as hormones, growth factors, enzymes, ligands and receptors and fragments thereof. 
Therapeutically active substances include glycoproteins, proteins, peptides and proteoglycans which have been used 
in medicine and research, e.g., to achieve a beneficial result in relation to disease states, trauma and/or to increase 
efficiency of normal cellular functions. Examples of therapeutically active glycoproteins, proteins, peptides and prote- 

45 oglycans include cellular regulatory factors such as interleukins, GCSF, erythropoietin, insulin, growth hormone, ACTH, 
thyroid hormones, various growth factors, osteogenic or osteoinductive factors, decorin and the like. 

Osteogenic agents are any of a family of proteins or peptides that induce formation of bone and/or cartilage. Oste- 
ogenin, bone morphogenic protein ("BMP") or osteoinductive protein are other terms which describe proteins having 
bone inducing activity. BMPs are a family of related proteins that trigger the developmental cascade of bone differentiation 

so by inducing mesenchymal stem cells to grow into a variety of tissues including bone, cartilage, and dentin. The activity 
of BMPs is particularly useful for repairing large bone defects which may not heal without clinical intervention. 

Osteogenic agents have been isolated from demineralized mammalian bone tissue (see, e.g., U.S. Patent Nos. 
4,294.753 and 4,761.471). Substantially pure BMPs have been produced by recombinant DNA techniques (see. e.g., 
U.S. Patent Nos. 5,106,748, 5.187,076, 5.141,905, 5,108,922, 5,166.058. and 5,116.738). U.S. Patent No. 5.168.050 

55 describes the use of a DNA construct having a DNA sequence encoding the precursor portion of BMP-2A ligated to a 
DNA sequence encoding BMP-2B for obtaining improved expression of BMP-2B. 

Certain methods have been employed for inducing formation of bone and/or cartilage with BMPs. When BMP is 
implanted in viable tissue without a delivery formulation, the BMP resorbs rapidly and does not effectively induce bone 
formation. Therefore, formulations for delivery or implantation of BMPs have been developed. 
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The following are examples of attempts to make delivery devices for BMPs. U.S. Patent No. 4,472,840 describes 
collagen and BMP conjugates or complexes in the form of microporous sponges to induce the formation of osseous 
tissue in animals or humans. U.S. Patent No. 4,975,527 describes enzyme-sol ubilized collagen as a carrier of bone 
morphogenic protein. U.S. Patent No. 4,563,489 describes delivery systems for BMP that are admixtures of biodegrad- 
5 able organic polymers such as polylactic acid and polyglycolic acid. 

U S Patent No. 5,106,626 describes administration of osteogenic protein extracted from mammalian bone admixed 
with or absorbed on a matrix such as tricalcium phosphate, hydroxyapatite, thermoplastic polymer materials, collagen, 
plaster of paris, polylactic acid, polycaprolactic acid, or polyglycolic acid. U.S. Patent Nos. 5,011,691 and 5,250,302 
describe methods of purifying osteogenic protein from mammalian bone and combining it with a matrix of porous material 
10 such as collagen, homopolymers or copolymers of glycolic acid and lactic acid, hydroxyapatite, or tricalcium phosphate. 
It has been suggested that to prevent rapid resorption of BMP from a site of implantation, osteogenic sequestering 
agents may be used in connection with an admixture of osteogenic protein and a porous polymeric matrix. U.S. Patent 
No. 5,171,579 describes a composition of an admixture of an osteogenic protein, a porous particulate matrix and an 
osteogenic protein sequestering amount of blood clot PCT WO 93/00050 describes an admixture of an osteogenic 
is protein, a polymer matrix of poly (lactic acid), poly (glycolic acid), and copolymers of lactic acid and glycolic acid, and 
an osteogenic protein-sequestering material which may bealkylcellulose, hyaluronic acid, alginate, poly(ethylene glycol), 
polyoxyethylene oxide, carboxyvinyl polymer, polyvinyl alcohol) or carboxymethylcellulose. 

Notwithstanding the research done in the area of drug delivery devices, compositions which deliver a clinically 
effective dose of therapeutic agents over a predetermined period of time to precise target sites that combine easy han- 
20 dling for the medical practitioner with manufacturing convenience are still desirable. Elimination of the above-mentioned 
separate purified matrix materials, sequestering agents and substitution of more effective therapeutically active compo- 
sitions would be advantageous. 

SUMMARY 

25 

Chimeric proteins having a domain derived from at least one extracellular matrix protein and a domain derived from 
at least one cellular regulatory factor are provided. Suitable domains derived from cell regulatory factors include oste- 
ogenic domains, domains derived from a transforming growth factor, and domains derived from dermatan sulfate pro- 
teoglycans. 

30 Recombinant DNA constructs having DNA sequences encoding the above mentioned chimeric proteins are pro- 
vided. Cloning vectors incorporating the above DNA constructs and cells transformed with the vectors are also provided. 
Therapeutic compositions incorporating the above-mentioned chimeric protein(s) and pharmaceutical^ acceptable vehi- 
cles are provided. For example, a drug delivery composition is provided which has a chimeric protein having a domain 
derived from a fibrous protein and a domain derived from a physiologically active glycoprotein, protein, peptide and/or 

35 proteoglycan. 

Methods for preparing a DNA construct including a DNA sequence encoding a cell regulatory factor (such as an 
osteogenic agent, a transforming growth factor, and/or a dermatan sulfate proteoglycan) operably linked to a DNA 
sequence encoding an extracellular matrix protein are provided. Also provided are methods of manufacturing osteo- 
genic/extracellular matrix, transforming growth factor/extracellular matrix, and/or dermatan sulfate proteoglycan/extra- 

40 cellular matrix chimeric proteins by transforming a cell with a suitable cloning vector including a DNA construct encoding 
the osteogenic/extracellular matrix chimeric protein, the transforming growth factor/extracellular matrix chimeric protein, 
or the dermatan sulfate proteoglycan/extracellular matrix chimeric protein, respectively, culturing the cell in a suitable 
culture medium and isolating the chimeric protein from the culture medium. 

In other embodiments, methods for inducing formation of bone, soft tissue repair, and reducing scar formation involve 

45 contacting with a suitable locus an osteogenic chimeric protein, a soft tissue chimeric protein, or an anti-scarring chimeric 
protein are provided, respectively. Suitable osteogenic chimeric proteins have a domain derived from one or more oste- 
ogenic agents and a domain derived from one or more extracellular matrix proteins. Suitable soft tissue chimeric proteins 
have a domain derived from at least one transforming growth factor and a domain derived from one or more extracellular 
matrix proteins. Suitable anti-scarring chimeric proteins have a domain derived from dermatan sulfate proteoglycan and 

so a domain derived from one or more extracellular matrix proteins. Further provided are methods for inducing bone for- 
mation, soft tissue repair, and reducing scar formation by contacting the osteogenic chimeric protein, the soft tissue 
chimeric protein, or the anti-scarring chimeric protein, respectively, with an implant at a suitable locus in viable tissue. 

BRIEF DESCRIPTION OF THE DRAWINGS 

55 

Fig. 1 depicts a nucleic acid sequence which encodes a BMP2B/collagen IA protein construct. 

Fig. 2 depicts a nucleic acid sequence which encodes a transforming growth factor p/collagen IA protein construct. 

Fig. 3 depicts a nucleic acid sequence which encodes a dermatan sulfate proteoglycan/collagen I A protein construct. 
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Fig. 4 depicts a nucleic acid sequence which encodes a dermatan sulfate proteoglycan peptide/collagen I A protein 
construct. 

Fig. 5 depicts an amino acid sequence for a BMP2B/collagen IA chimeric protein. 

Rg. 6 depicts an amino acid sequence for a TGFp/collagen IA chimeric protein. 
5 Fig. 7 depicts an amino acid sequence for a dermatan sulfate proteoglycan/collagen IA chimeric protein. 

Fig. 8 depicts an amino acid sequence for a dermatan sulfate proteoglycan peptide/collagen IA chimeric protein. 

Fig. 9 depicts a pMal cloning vector containing a polylinker cloning site. 

Rg. 10 depicts a polylinker cloning site contained in a pMal cloning vector of Fig. 9. 

Rg. 1 1 depicts a pMal cloning vector containing a BMP2B/collagen IA DNA construct. 
10 Rg. 12 depicts a pMal cloning vector containing a TGFp/collagen IA DNA construct. 

Fig. 13 depicts a pMal cloning vector containing a decorin/cotlagen IA DNA construct. 

Rg. 14 depicts a pMal cloning vector containing a decorin peptide/collagen IA DNA construct. 

DETAILED DESCRIPTION 

75 

Chimeric proteins provide an integrated combination of a therapeutically active domain containing one or more thera- 
peutically active moieties and an extracellular matrix protein domain containing one or more EMP moieties. The EMP 
domain provides an integral vehicle for delivery of the therapeutically active moiety to a target site. The two domains are 
linked covalently by one or more peptide bonds contained in a linker region. As used herein, integrated or integral means 
20 characteristics which result from the covalent association of one or more domains of the inventive chimeric proteins. 
The therapeutically active moieties disclosed herein are typically made of amino acids linked to form peptides, proteins, 
glycoproteins or proteoglycans. 

The inherent characteristics of EMPs are ideal for use as a vehicle for the therapeutic moiety. Examples of suitable EMPs 
are collagen, elastin. fibronectin, and fibrin. Fibrillar collagens (Type l t II and III) assemble into ordered polymers and 

25 often aggregate into larger bundles. Type IV collagen assembles into sheetfike meshworks. Elastin molecules form 
filaments and sheets in which the elastin molecules are highly cross-linked to one another and provides good elasticity 
and high tensile strength. The cross-linked, random-coiled structure of the fiber network allows it to stretch and recoil 
like a rubber band; Fibronectin is a largefibril forming glycoprotein, which, in one of its forms,- consists of highly insoluble 
fibrils cross-linked to each other by disulfide bonds. Fibrin is an insoluble protein formed from fibrinogen by the proteolytic 

30 activity of thrombin during the normal clotting of blood. 

The molecular and macromolecular morphology of the above EMPs defines networks or matrices to provide sub- 
stratum or scaffolding in integral covalent association with the therapeutically active moiety. The networks or matrices 
formed by the EMP domain provide an environment particularly weii suited for ingrowth of autologous cells involved in 
growth, repair and replacement of existing tissue. The integral therapeutically active moieties covalently bound within 

35 the networks or matrices provide maximum exposure of the active agents to their targets to elicit a desired response. 
Implants formed of or from the present chimeric proteins provide sustained release activity in or at a desired locus 
or target site. Unlike the above-described compositions discussed in the Background which incorporate a vehicle not 
covalently linked to an EMP, the therapeutically active domain of the present chimeric protein is not free to separately 
diffuse or otherwise be transported away from the vehicle which carries it, absent cleavage of peptide bonds. Conse- 

40 quently, chimeric proteins provide an effective anchor for therapeutic activity which allows the activity to be confined a 
target location for a prolonged duration. Because the supply of therapeutically active agent does not have to be replen- 
ished as often, smaller amounts of therapeutically active agent may be used over the course of therapy. Consequently, 
certain advantages provided by the inventive chimeric proteins are a decrease or elimination of local and systemic side 
effects, less potentiation or reduction in therapeutic activity with chronic use, and minimization of drug accumulation in 

45 body tissues with chronic dosing. 

Use of recombinant technology allows manufacturing of nonimmunogenic chimeric proteins. The DNA encoding 
both the therapeutically active moiety and EMP moiety should preferably be derived from the same species as the patient 
being treated to avoid an immunogenic reaction. For example, if the patient is human, the therapeutically active moiety 
as well as the EMP moiety is preferably derived from human DNA. 

50 Osteogenic/EMP chimeric proteins provide biodegradable and biocompatible agents for inducing bone formation at 
a desired site. In one embodiment a BMP moiety is covalently linked with an EMP to form a chimeric protein. The BMP 
moiety induces osteogenesis and the extracellular matrix protein moiety provides an integral substratum or scaffolding 
for the BMP moiety and cells which are involved in reconstruction and growth. Compositions containing the BMP/EM P 
chimeric protein provide effective sustained release delivery of the BMP moiety to desired target sites. The method of 

55 manufacturing such an osteogenic agent is efficient because the need for extra time consuming steps such as purifying 
EMP and then admixing it with the purified BMP are eliminated. An added advantage of the BMP/EMP chimeric protein 
results from the stability created by the covalent bond between BMP and the EMP, i.e., the BMP portion is not free to 
separately diffuse away from the EMP, thus providing a more stable therapeutic agent. 
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Bone morphogenic proteins are class identified as BMP-1 through BMP-9. A preferred osteogenic protein for use 
in human patients is human BMP-2B. A BMP-2B/collagen IA chimeric protein is illustrated in Fig. 5. The protein sequence 
illustrated in Fig. 5 includes a collagen helical domain depicted at amino acids 1 -1057 and a mature form of BMP2B at 
amino acids 1060-1 169. The physical properties of the chimeric protein are dominated in part by the EMP component. 

5 In the case of a collagen moiety, a concentrated solution of chimeric protein will have a gelatinous consistency that 
allows easy handling by the medical practitioner. The EMP moiety acts as a sequestering agent to prevent rapid des- 
orption of the BMP moiety from the desired site and provide sustained release of BMP activity. As a results the BMP 
moiety remains at the desired site for a period of time necessary to effectively induce bone formation. The EMP moiety 
also provides a matrix which allows a patient's autologous cells, e.g., chondrocytes and the like, which are normally 

10 involved in osteogenesis to collect therein and form an autologous network for new tissue growth. The gelatinous con- 
sistency of the chimeric protein also provides a useful and convenient therapeutic manner for immobilizing active BMP 
on a suitable vehicle or implant for delivering the BMP moiety to a site where bone growth is desired. 

The BMP moiety and the EMP moiety are optionally linked together by linker sequences of amino acids. Examples 
of linker sequences used are illustrated within the sequences depicted in Figs. 1 -4 and described in more detail below. 

is Linker sequences may be chosen based on particular properties which they impart to the chimeric protein. For example, 
amino acid sequences such as lle-Glu-Gly-Arg and Leu-Val-Pro-Arg are cleaved by Factor Xa and Thrombin enzymes, 
respectively. Incorporating sequences which are cleaved by proteolytic enzymes into chimeric proteins provides cleavage 
at the linker site upon exposure to the appropriate enzyme and separation of the two domains into separate entities. It 
is contemplated that numerous linker sequences can be incorporated into any of the chimeric proteins. 

20 In another embodiment, a chimeric DNA construct includes a gene encoding an osteogenic protein or a fragment 
thereof linked to a gene encoding an EMP or a fragment thereof. The gene sequences for various BMPs are known, 
see, e.g., U.S. Patent Nos. 4,294,753, 4,761,471, 5,106,748, 5,187,076, 5.141,905. 5,108.922, 5.166,058, 5,116,738 
and 5,168,050, each incorporated herein by reference. A BMP-2B gene for use with this invention is synthesized by 
ligating oligonucleotides encoding a BMP protein. The oligonucleotides encoding BMP-2B are synthesized using an 

25 automated DNA synthesizer (Beckmen Oligo-1000). In a preferred embodiment, the nucleotide sequence encoding the 
BMP is maximized for expression in E. coli. This is accomplished by using E. coli utilization tables to translate the 
sequence of amino acids o1 the BMP into codons that are utilized most often by E. coli. Alternatively, native DNA encoding 
BMP isolated from mammals including humans may be purified and used. 

The BMP gene and the DNA sequence encoding an extracellular matrix protein are cloned by standard genetic 

30 engineering methods as described in Maniatis et al.. Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 1982, 
hereby incorporated by reference. 

The DNA sequence corresponding to the helical region of collagen l(a) is cloned from a human fibroblast cell line. 
Two sets of polymerase chain reactions are carried out using cDNA prepared by standard methods from AG02261 A 
cells. The first pair of PCR primers include a 5* primer bearing an Xmni linker sequence and a 3' primer bearing the 

35 Bsml site at nucleotide number 1722. The resulting PCR product consists of sequence from position 1 to 1722. The 
second pair of primers includes the Bsml site at 1722 and a linker sequence at the 3' end bearing a Bglll site. The 
resulting PCR products consists of sequence from position 1722 to 3196. The complete helical sequence is assembled 
by standard cloning techniques. The two PCR products are ligated together at the Bsml site, and the combined clone 
is inserted into any vector with XmnlBglll sites of Xmnl-BamHI sites such as pMALc2-vector. 

40 To clone the BMP-2B gene, total cellular RNA is isolated from human osteosarcoma cells (U-20S) by the method 
described by Robert E. Farrel Jr. (Academic Press, CA, 1 993 pp.68-69) (herein incorporated by reference). The integrity 
of the RNA is verified by spectrophotometry analysis and electrophoresis through agarose gels. Typical yields of total 
RNA are 50 jutg from a 100mm confluent tissue culture dish. The RNA is used to generate cDNA by reverse transcription 
using the Superscript pre-amplrfication system by Gibco BRL. The cDNA is used as template for PCR amplification using 

45 upstream and downstream primers specific for BMP-2B (GenBank HUMBMP2B accession # M22490). The resulting 
PCR product consists of BMP-2B sequence Eom position 1289-1619. The PCR product is resolved by electrophoresis 
through agarose gels, purified with gene clean (BIO 101) and ligated into pMal-c2 vector (New England Biolabs). The 
helical domain of human collagen l(a) chain is cloned in a similar manner. However, the total cellular RNA is isolated 
from a human fibroblast cell line (AG02261 A human skin fibroblasts). 

so A chimeric BMP/EM P DNA construct is obtained by ligating a synthetic BMP gene to a DNA sequence encoding 
an EMP such as collagen, fibrin, fibronectin, elastin or laminin. However, the invention is not limited to these particular 
proteins. Fig. 1 illustrates a DNA construct which encodes a BMP-2B/collagen IA chimeric protein. The coding sequence 
for an EMP may be ligated upstream and/or downstream and in-frame with a coding sequence for the BMP. The DNA 
encoding an EMP may be a portion of the gene or an entire EMP gene. Furthermore, two different EMPs may be ligated 

55 upstream and downstream from the BMP. 

The BMP-2B/collagen IA chimeric protein illustrated in Fig. 1 includes an Xmnl linker sequence at base pairs (bp) 
1-19, a collagen helical domain (bp 20-3190), a Bglll/BamHI linker sequence (bp 3191-3196), a mature form of BMP- 
2B (bp 3197-3529) and a Hindlll linker sequence (bp 3530-3535). 
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Any combination of growth factor and matrix protein sequences are contemplated including repeating units, or mul- 
tiple arrays of each segment in any order. Incorporation of fragments of both matrix and growth factor proteins is also 
contemplated. For example, in the case of collagen, only the helical domain may be included. Other matrix proteins have 
defined domains, such as laminin, which has EGF-like domains. In these cases, specif ic functionalities can be chosen 

5 to achieve desired effects. Moreover, it may be useful to combine domains from disparate matrix proteins, such as the 
helical region of collagen and the cell attachment regions of f ibronectin. In the case of growth factors, specific segments 
have been shown to be removed from the mature protein by post translations processing. Chimeric proteins can be 
designed to include only the mature biologically active region. For example, in the case of BMP-2B only the final 110 
amino acids are found in the active protein. 

10 In another embodiment a transforming growth factor (TGF) moiety is covalently linked with an EMP to form a chimeric 
protein. The TGF moiety increases efficacy of the body's normal soft tissue repair response and also induces osteo- 
genesis. Consequently, TGF/EMP chimeric proteins may be used for either or both functions. One of the fundamental 
properties of the TGFps is their ability to turn on various activities that result in the synthesis of new connective tissue. 
See, Piez and Sporn eds., Transforming Growth Factor-ps Chemistry, Biology and Therapeutics, Annals of the New York 

15 Academy of Sciences, Vol. 593, (1990). TGF-p is known to exist in at least five different isoforms. The DNA sequence 
for Human TGF-p! is known and has been cloned. See Derynck et al., Human Transforming Growth Factor-Beta cDNA 
Sequence and Expression in Tumour Cell Lines, Nature, Vol. 316, pp. 701-705 (1 985), herein incorporated by reference. 
TGF-p 2 has been isolated from bovine bone, human glioblastoma cells and porcine platelets. TGF-p 3 has also been 
cloned. See ten Dijke, et al., Identification of a New Member of the Transforming Growth Factor-p Gene Family, Proc. 

20 Natl. Acad. Sci. (USA), Vol. 85, pp. 4715-4719 (1988) herein incorporated by reference. 

A TGF-p/EMP chimeric protein incorporates the known activities of TGF-ps and provides integral scaffolding or 
substratum of the EMP as described above to yield a composition which further provides sustained release focal delivery 
at target sites. 

The TGF-p moiety and the EMP moiety are optionally linked together by linker sequences of amino acids. Linker 

25 sequences may be chosen based upon particular properties which they impart to the chimeric protein. For example, 
amino acid sequences such as lie-Glu-Glyn-Arg and Leu-Vai-Pro- Arg are cleaved by Factor Xa and Thrombin enzymes, 
respectively. Incorporating sequences which are cleaved by proteolytic enzymes into the chimeric protein provides cleav- 
age at the linker site upon exposure to the appropriate enzyme and separation of the domains into separate entities. 
Fig. 6 depicts an amino acid sequence for a TGF-p^collagen I A chimeric protein. The illustrated amino acid sequence 

30 includes the collagen helical domain (1-1057) and a mature form of TGF-Pi (1060-1171). 

A chimeric DNA construct includes a gene encoding TGF-P| or a fragment thereof, or a gene encoding TGF-p 2 or 
a fragment thereof, or a gene encoding TGF-p 3 or a fragment thereof, ligated to a DNA sequence encoding an EMP 
protein such as collagen (f-IV), fibrin, fibronectin, elastin or laminin. A preferred chimeric DNA construct combines DNA 
encoding TGF-p,, a DNA linker sequence, and DNA encoding collagen IA. A chimeric DNA construct containing TGF- 

35 p, gene and a collagen IA gene is shown in Fig. 2. The illustrated construct includes an Xmnl linker sequence (bp 1 -1 9), 
DNA encoding a collagen helical domain (bp 20-3190), a Bglll linker sequence (bp 3191-3196), DNA encoding a mature 
form of TGF-p, (31 97-3535), and an Xbal linker sequence (bp 3536-3541 ). 

The coding sequence for EMP may be ligated upstream and/or downstream and in-frame with a coding sequence 
for the TGFp. The DNA encoding the extracellular matrix protein may encode a portion of fragment of the EMP or may 

40 encode the entire EMP. Likewise, the DNA encoding the TGF-p may be one or more fragments thereof or the entire 
gene. Furthermore, two or more different TGF-ps or two or more different EM Ps may be ligated upstream or downstream 
of alternate moieties. 

In yet another embodiment, a dermatan sulfate proteoglycan moiety, also known as decorin or proteoglycan II, is 
covalently linked with an EMP to form a chimeric protein. Decorin is known to bind to type I collagen and thus affect fibril 

45 formation, and to inhibit the cell attachment promoting activity of collagen and fibrinogen by binding to such molecules 
near their cell binding sites. Chimeric proteins which contain a decorin moiety act to reduce scarring of healing tissue. 
The primary structure of the core protein of decorin has been deduced from cloned cDNA. See Krusius et al., Primary 
Structure of an Extracellular Matrix Proteoglycan Core Protein-Deduced from Cloned cDN A, Proc. Natl. Acad. Sci . (USA), 
Vol. 83, pp. 7683-7687 (1986) incorporated herein by reference. 

so A decorin/EMP chimeric protein incorporates the known activities of decorin and provides integral scaffolding or 
substratum of the EMP as described above to yield a composition which allows sustained release focal delivery to target 
sites. Fig. 7 illustrates a decorin/collagen IA chimeric protein in which the collagen helical domain includes amino acids 
1-1057 and the TGF-p mature protein includes amino acids 1 060- 1171. Fig. 8 illustrates a decorin peptide/collagen IA 
chimeric protein in which the collagen helical domain includes amino acids 1 - 1057 and the decorin peptide fragment 

55 includes amino acids 1060- 1 107. The decorin peptide fragment is composed of P46toG93of the mature form of decorin. 
Further provided is a chimeric DNA construct which includes a gene encoding decorin or one or more fragments 
thereof, optionally ligated via a DNA linker sequence to a DNA sequence encoding an EMP such as collagen (l-IV), 
fibrin, f ibronectin, elastin or laminin. A preferred chimeric DNA construct combines DNA encoding decorin, a DNA linker 
sequence, and DNA encoding collagen IA. A chimeric DNA construct containing a decorin gene and a collagen IA gene 
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is shown in Fig. 3. The illustrated construct includes an Xmnl linker sequence (bp 1 -1 9), DNA encoding a collagen helical 
domain (bp 20-3190), a Bglll linker sequence (bp 3191-3196), DNA encoding a mature form of decorin (bp 3197-4186) 
and a Pstl linker sequence. A chimeric DNA construct containing a decorin peptide gene and a collagen IA gene is 
shown in Fig. 4. The illustrated construct includes an Xmnl linker sequence (bp 1- 19), DNA encoding a collagen helical 

5 domain (bp 20-3190), a Bglll linker sequence (bp 3191-3196), DNA encoding a peptide fragment of decorin (bp 3197- 
3343). and a Pstl linker sequence (bp 3344-3349). 

The coding sequence for an EMP may be ligated upstream and/or downstream and in-frame with a coding sequence 
for decorin. The DNA encoding the EMP may encode a portion or fragment of the EMP or may encode the entire EMP. 
Likewise, the DNA encoding decorin may be a fragment thereof or the entire gene. Furthermore, two or more different 

w EMPs may be ligated upstream from the DNA encoding decorin moiety. 

Any of the above described chimeric DNA constructs may be incorporated into a suitable cloning vector. Fig. 9 
depicts a^pMal cloning vector containing a polylinker cloning site. Preferred cloning vectors are the plasmids pMal-p2 
and pMal^c2 (commercially available from New England Biolabs). The desired chimeric DNA construct is incorporated 
into a polylinker sequence of the plasmid which contains certain useful restriction endonuclease sites which are depicted 

15 in Fig. 10. The pMal-p2 polylinker sequence has Xmnl, EcoRI, BamHI, Hindlli, Xbal, Sail and Pstl restriction endonu- 
clease sites which are depicted in Fig. 10. The polylinker sequence is digested with an appropriate restriction endonu- 
clease and the chimeric construct is incorporated into the cloning vector by ligating it to the DNA sequences of the 
plasmid. The chimeric DNA construct may be joined to the plasmid by digesting the ends of the DNA construct and the 
plasmid with the same restriction endonuclease to generate "sticky ends" having 5* phosphate and 3* hydroxy! groups 

20 which allow the DNA construct to anneal to the cloning vector. Gaps between the inserted DNA construct and the plasmid 
are then sealed with DNA ligase. Other techniques for incorporating the DNA construct into plasmid DNA include blunt 
end ligation, poly(dA detailing techniques, and the use of chemically synthesized linkers. An alternative method for 
introducing the chimeric DNA construct into a cloning vector is to incorporate the DNA encoding the extracellular matrix 
protein into a cloning vector already containing a gene encoding a therapeutically active moiety. 

25 The cloning sites in the above-identified polylinker site allow the cDNA for the collagen I A/BMP-2B chimeric protein 
illustrated in Fig. 1 to be inserted between the Xmnl and the Hindlli sites. The cDNA encoding the collagen LtTGF-p, 
protein illustrated in Fig. 2 is inserted between the Xmnl and the Xbal sites. The cDNA encoding the collagen lA/decorin 
protein illustrated in Fig. 3 is inserted between the Xmnl and the Pstl sites. The cDNA encoding the collagen lA/decorin 
peptide (dec 1) illustrated in Fig. 4 is inserted between the Xmnl and Pstl sites. 

30 Plasmids containing the chimeric DNA construct are identified by standard techniques such as gel electrophoresis. 
Procedures and materials for preparation of recombinant vectors, transformation of host ceils with the vectors, and host 
cell expression of polypeptides are described in Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor 1 982 hereby incorporated by reference. Generally, prokaryotic or eukaryotic host cells may be transformed with 
the recombinant DNA plasmids. Transformed host cells may be located through phenotypic selection genes of the cloning 

35 vector which provide resistance to a particular antibiotic when the host cells are grown in a culture medium containing 
that antibiotic. 

Transformed host cells are isolated and cultured to promote expression of the chimeric protein. The chimeric protein 
may then be isolated from the culture medium and purified by various methods such as dialysis, density gradient cen- 
trifugation, liquid column chromatography, isoelectric precipitation, solvent fractionation, and electrophoresis. However, 

40 purification of the chimeric protein by affinity chromatography is preferred whereby the chimeric protein is purified by 
ligating it to a binding protein and contacting it with a ligand or substrate to which the binding protein has a specific affinity. 

In order to obtain more effective expression of mammalian or human eukaryotic genes in bacteria (prokaryotes). 
the mammalian or human gene should be placed under the control of a bacterial promoter. A protein fusion and purifi- 
cation system is employed to obtain the chimeric protein. Preferably, any of the above-described chimeric DNA constructs 

45 is cloned into a pMal vector at a site in the vector's polylinker sequence. As a result, the chimeric DNA construct is 
operably fused with the malE gene of the pMal vector. The malE gene encodes maltose binding protein (MBP). Fig. 1 1 
depicts a pMal cloning vector containing a BMP/collagen DNA construct. A spacer sequence coding for 10 asparagine 
residues is located between the malE sequence and the polylinker sequence. This spacer sequence insulates MBP 
from the protein of interest. Figs. 12, 13 and 14 depict pMal cloning vectors containing DNA encoding TGF-pi , decorin 

so and a decorin peptide, respectively. The pMal vector containing any of the chimeric DNA constructs fused to the malE 
gene is transformed into E. coli. This technique utilizes the PtaC promoter of the malE gene. 

The E. coli is cultured in a medium which induces the bacteria to produce the maltose binding protein fused to the 
chimeric protein. The MBP contains a 26 amino acid N-terminal signal sequence which directs the MBP -chimeric protein 
through the E. coli cytoplasmic membrane. The protein can then be purified from the periplasm. Alternatively, the pMai- 

55 c2 cloning vector can be used with this protein fusion and purification system. The pMal-c2 vector contains an exact 
deletion of the malE signal sequence which results in cytoplasmic expression of the fusion protein. A crude cell extract 
containing the fusion protein is prepared and poured over a column of amylose resin. Since MBP has an affinity for the 
amylose it binds to the resin. Alternatively, the column can include any substrate for which MBP has a specific affinity. 
Unwanted proteins present in the crude extract are washed through the column. The MBP fused to the chimeric protein 



7 



BNSDOCID <EP 07O4532A2 I > 



EP0 704 532 A2 



is eluted from the column with a neutral buffer containing maltose or other dilute solution of a desorbing agent for dis- 
placing the hybrid polypeptide. The purified MBPchimeric protein is cleaved with a protease such as factor Xa protease 
to cleave the MBP from the chimeric protein. The pMal-p2 plasmid has a sequence encoding the recognition site for 
protease factor Xa which cleaves after the amino acid sequence Isoleucine-Glutamic acid-Glycine-Arginine of the 
5 polylinker sequence. 

The chimeric protein is then separated from the cleaved MBP by passing the mixture over an amylose column. An 
alternative method for separating the MBP from the chimeric protein is by ion exchange chromatography. This system 
yields up to 100mg of MBP-chimeric protein per liter of culture. See Riggs, P., in Ausebel, F.M., Kingston, R.E., Moore, 

D. D., Seidman, J.G., Smith, J.A., Struhl, K. (eds.) Current Protocols in Molecular Biology, Supplement 19 (16.6.1 16.6.10) 
10 (1990) Green Associates/Wiley Interscience, New York, New England Biolabs (cat # 800-65S 9pMALc2) pMal protein 

fusion and purification system hereby incorporated by reference. (See also European Patent No. 286 239 herein incor- 
porated by reference which discloses a similar method for production and purification of a protein such as collagen.) 
Other protein fusion and purification systems may be employed to produce chimeric proteins. Prokaryotes such as 

E. coli are the preferred host cells for expression of the chimeric protein. However, systems which utilize eukaryote host 
15 cell lines are also acceptable such as yeast, human, mouse, rat, hamster, monkey, amphibian, insect, algae, and plant 

cell lines. For example, HeLa (human epithelial), 3T3 (mouse fibroblast), CHO (Chinese hamster ovary), and SP 2 
(mouse plasma cell) are acceptable cell lines. The particular host cells that are chosen should be compatible with the 
particular cloning vector that is chosen. 

Another acceptable protein expression system is the Baculovirus Expression System manufactured by Invitrogen 

20 of San Diego, California. Baculoviruses form prominent crystal occlusions within the nuclei of cells they infect. Each 
crystal occlusion consists of numerous virus particles enveloped in a protein called polyhedrin. In the baculovirus expres- 
sion system, the native gene encoding polyhedrin is substituted with a DNA construct encoding a protein or peptide 
having a desired activity. The virus then produces large amounts of protein encoded by the foreign DNA construct. The 
preferred cloning vector for use with this system is pBlueBac III (obtained from Invitrogen of San Diego, California). The 

25 baculovirus system utilizes the Autograph californica multiple nuclear polyhidrosis virus (AcMNPV) regulated polyhedrin 
promoter to drive expression of foreign genes. AcMNPV is isolated from a moth called the California looper. The chimeric 
gene, i.e. , the DNA construct encoding the chimeric protein, is inserted into the pBlueBac I II vector immediately down- 
stream from the baculovirus polyhedrin promoter. 

The pBlueBac III transfer vector contains a B-galactosidase reporter gene which allows for identification of recom- 

30 binant virus. The B-galactosidase gene is driven by the baculovirus ETL promoter (PETL) which is positioned in opposite 
orientation to the polyhedrin promoter (PpH) and the multiple cloning site of the vector. Therefore, recombinant virus 
coexpresses B-galactosidase and the chimeric gene. 

Spodoptera frugeperda (Sf9) insect cells are then cotransfected with wild type viral DNA and the pBlueBac III vector 
containing the chimeric gene. Recombination sequences in the pBlueBac III vector direct the vector's integration into 

35 the genome of the wild type baculovirus. Homologous recombination occurs resulting in replacement of the native pol- 
yhednn gene of the baculovirus with the DNA constuct encoding the chimeric protein. Wild type baculovirus which do 
not contain foreign DNA express the polyhedrin protein in the nuclei of the infected insect cells. However, the recom- 
binants do not produce polyhedrin protein and do not produce viral occlusions. Instead, the recombinants produce the 
chimeric protein. 

40 Alternative insect host cells for use with this expression system are Sf21 cell line derived from Spodoptera frugeperda 

and High Five cell lines derived from Trichoplusia ni. 

Other acceptable cloning vectors include phages, cosmids or artificial chromosomes. For example, bacteriophage 

lambda is a useful cloning vector. This phage can accept pieces of foreign DNA up to about 20,000 base pairs in length. 

The lambda phage genome is a linear double stranded DNA molecule with single stranded complementary (cohesive) 
45 ends which can hybridize with each other when inside an infected host cell. The lambda DNA is cut with a restriction 

endonucfease and the foreign DNA, e.g. the DNA to be cloned, is ligated to the phage DNA fragments. The resulting 

recombinant molecule is then packaged into infective phage particles. Host cells are infected with the phage particles 

containing the recombinant DNA. The phage DNA replicates in the host cell to produce many copies of the desired DNA 

sequence. 

so Cosmids are hybrid plasmid/bacteriophage vectors which can be used to clone DNA fragments of about 40,000 
base pairs. Cosmids are plasmids which have one or more DNA sequences called n cos M sites derived from bacteriophage 
lambda for packaging lambda DNA into infective phage particles. Two cosmids are ligated to the DNA to be cloned. The 
resulting molecule is packaged into infective lambda phage particles and transfected into bacteria host cells. When the 
cosmids are inside the host cell they behave like plasmids and multiply under the control of a plasmid origin of replication. 

55 The origin of replication is a sequence of DNA which allows a plasmid to multiply within a host cell. 

Yeast artificial chromosome vectors are similar to plasmids but allow for the incorporation of much larger DNA 
sequences of about 400,000 base pairs. The yeast artificial chromosomes contain sequences for replication in yeast. 
The yeast artificial chromosome containing the DNA to be cloned is transformed into yeast cells where it replicates 
thereby producing many copies of the desired DNA sequence. Where phage, cosmids, or yeast artificial chromosomes 
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are employed as cloning vectors, expression of the chimeric protein may be obtained by culturing host cells that have 
been transacted or transformed with the cloning vector in a suitable culture medium. 

Chimeric proteins disclosed herein are intended for use in treating mammals or other animals. The therapeutically 
active moieties described above, namely, osteogenic agents such as BMPs, TGFs, decorin, and/or fragments of each 
s of them, are all to be considered as being or having been derived from cellular regulatory factors for purposes. The 
chimeric proteins and DNA constructs which incorporate a domain derived from one or more cellular regulatory factors 
can be used for in vivo therapeutic treatment, in vitro research or for diagnostic purposes in general. 

When used in vivo, formulations containing the inventive chimeric proteins may be placed in direct contact with 
viable tissue, including bone, to induce or enhance growth, repair and/or replacement of such tissue. This may be accom- 
10 plished by applying a chimeric protein directly to a target site during surgery. It is contemplated that minimally invasive 
techniques such as endoscopy are to be used to apply a chimeric protein to a desired location. Formulations containing 
the chimeric proteins disclosed herein may consist solely of one or more chimeric proteins or may also incorporate one 
or more pharmaceutical^ acceptable adjuvants. 

In an alternate embodiment, any of the above-described chimeric proteins may be contacted with, adhered to, or 
is otherwise incorporated into an implant such as a drug delivery device or a prosthetic device. Chimeric proteins may be 
microencapsulated or macroencapsulated by liposomes or other membrane forming materials such as alginic acid deriv- 
atives prior to implantation and then implanted in the form of a pouchlike implant. The chimeric protein may be micro- 
encapsulated in structures in the form of spheres, aggregates of core material embedded in a continuum of wall material 
or capillary designs. Microencapsulation techniques are well known in the art and are described in the Encyclopedia of 
20 Polymer Science and Engineering, Vol. 9, pp. 724 et seq. (1980) hereby incorporated by reference. 

Chimeric proteins may also be coated on or incorporated into medically useful materials such as meshes, pads, felts, 
dressings or prosthetic devices such as rods pins, bone plates, artificial joints, artificial limbs or bone augmentation 
implants. The implants may. in part, be made of biocompatable materials such as glass, metal, ceramic, calcium phos- 
phate or calcium carbonate based materials. Implants having biocompatible biomaterials are well known in the art and 
25 are all suitable for use. Implant biomaterials derived from natural sources such as protein fibers, polysaccharides, and 
treated naturally derived tissues are described in the Encyclopedia of Polymer Science and Engineering, Vol. 2, pp. 267 
et seq. (1989) hereby incorporated by reference. Synthetic biocompatible polymers are well known in the art and are 
also suitable implant materials. Examples of suitable synthetic polymers include urethanes, olefins, terephthalates, acr- 
y lates, polyesters and the like. Other acceptable implant materials are biodegradable hydrogels or aggregations of closely 
30 packed particles such as polymethylmethacrylate beads with a polymerized hydroxyethyl methacrylate coating. See the 
Encyclopedia of Polymer Science and Engineering, Vol. 2, pp. 267 et seq. (1989) hereby incorporated by reference. 

The chimeric protein provides a useful way for immobilizing or coating a cellular regulatory factor on a pharmaceu- 
tical^ acceptable vehicle to deliver the cellular regulatory factor to desired sites in viable tissue. Suitable vehicles include 
those made of bioabsorbable polymers, biocompatible nonabsorbable polymers, laetoner putty and plaster of Paris. 
35 Examples of suitable bioabsorbable and biocompatible polymers include homopolymers, copolymers and blends of 
hydroxyacids such as lactide and glycolide, other absorbable polymers which may be used alone or in combination with 
hydroxyacids include dioxanones, carbonates such as trimethylene carbonate, lactones such as caprolactone, polyoxy- 
aLkylenes. and oxylates. See the Encyclopedia of Polymer Science and Engineering, Vol. 2, pp. 230 et seq. (1989) 
hereby incorporated by reference. 
40 These vehicles may be in the form of beads, particles, putty, coatings or film vehicles. Diffusional systems in which 
a core of chimeric protein is surrounded by a porous membrane layer are other acceptable vehicles. 

The following examples should be considered as illustrative of certain embodiments disclosed herein but should 
not be considered as limiting the inventive disclosure. 

45 EXAMPLE I 

Cloning BMP-2B/collagen IA DNA segment constructs 

Obtaining PCR products for BMP-2B and Collagen l(a): The chimeric gene encoding the BMP-2B/Collagen 1(a) fusion 
so protein is assembled from PCR products. The PCR primers are designed to provide restriction sites on the 5' and 3' 
ends that facilitate later ligation steps. The 5' and 3' ends of the BMP-2B PCR product contain BamHI and Hindlll restric- 
tion sites respectively. The 5' and 3' ends of the Collagen 1(a) PCR product contain Xmnl and Bglll restriction sites 
respectively Amplification is carried out on template cDNA synthesized from total cellular RNA using standard methods. 
PCR reactions for BMP-2B and Collagen 1(a) use cDNA prepared from U-20S and AG02261 A cell lines respectively. 
55 After amplification and purification, the PCR products are ligated into PCR II vectors. Positive clones are identified by 
screening plasmids for the correct molecular weight. The clones are verified by DNA sequencing using standard methods. 
The BMP-2B PCR product is excised from PCRII by restriction digestion with BamHI and Hindlll and the Collagen l(a) 
segment was excised from PCRII using Xmnl and Bglll. The restriction digest reactions are resolved by electrophoresis 
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through agarose gels and the DNA fragments with the BMP-2B and Collagen 1(a) sequences are purified with gene 
clean (BIO 101). 

EXAMPLE II 

5 

Cloning TGF-p,/collagen IA DNA segment constructs 

Obtaining PCR products for TGF-pi and Collagen l(a): The chimeric gene encoding the TGF-pl/Collagen l(a) fusion 
protein is assembled from PCR products. The PCR primers are designed to provide restriction sites on the 5 and 3* 

io ends that facilitate later ligation steps. The 5' and 3' ends of the TGF-pi PCR product contain Bglll and Xbal restriction 
sites respectively. The 5' and 3' ends of the Collagen 1(a) PCR product contain Xmnl and Bglll restriction sites respectively. 
Amplification is carried out on template cDNA synthesized from total cellular RNA using standard methods. PCR reac- 
tions for TGF-pi and Collagen 1(a) use cDNA prepared from AG02261 A cells. After amplification and purification, the 
PCR products are ligated into PCR II vectors. Positive clones are identified by screening plasmids for the correct molec- 

rs ular weight. The clones are verified by DNA sequencing using standard methods. The TGF-pi PCR product is excised 
from PCR II by restriction digestion with Bglll and Xbal and the Collagen l(a) segment was excised from PCR II using 
Xmnl and Bglll. The restriction digest reactions are resolved by electrophoresis through agarose gels and the DNA 
fragments with the TGF-pi and Collagen l(a) sequences are purified with gene clean (BIO 101). 

20 EXAMPLE III 

Cloning dermatan sulfate proteoglycan (decorin)/collagen IA DNA segment constructs 

Obtaining PCR products for Decorin and Collagen l(a): The chimeric gene encoding the Decorin/Collagen l(a) fusion 
25 protein is assembled from PCR products. The PCR primers are designed to provide restriction sites on the 5 and 3' 
ends that facilitate later ligation steps. The 5* and 3' ends of the Decorin PCR product contain BamHI and Pstl restriction 
sites respectively. The 5* and 3" ends of the Collagen l(a) PCR product contain Xmnl and Bglll restriction sites respectively. 
Amplification is carried out on template cDNA synthesized from total cellular RNA using standard methods. PCR reac- 
tions for Decorin and Collagen l(a) use cDNA prepared from AG02261 A cells. After amplification and purification, the 
30 respective PCR products are ligated into respective PCR II vectors. Positive clones are identified by screening plasmids 
for the correct molecular weight. The clones are verified by DNA sequencing using standard methods. The Decorin PCR 
product is excised from PCR II by restriction digestion with BamHI and Pstl and the Collagen l(a) segment was excised 
from Pun II using Xmnl and Bglll. The restriction digest reactions are resolved by electrophoresis through agarose gels 
and the DNA fragments with the Decorin and Collagen l(a) sequences are purified with gene clean (BIO101). 

35 

EXAMPLE IV 

Construction of cloning vector incorporating DNA constructs of Example 1 

40 Ligation of BMP-2B and Collagen i(a) segments into the pMal-c2 expression vector: The pMal-c2 vector is treated 
with BamHI and Hind3, resolved by electrophoresis through an agarose gel and purified by standard methods. The BMP- 
SB segment with matching BamHI and Hind3 restriction sites on the 5' and 3' ends is ligated into pMal-c2 and trans- 
formants are screened for the insert by standard techniques. Positive clones are verified by DNA sequencing and des- 
ignated pMal-c2 BMP. To complete the construction, pMal-c2-BMP is digested with Xmnl and BamHI and the Collagen 

45 l(a) segment which is digested with Xmnl and Bglll is ligated into those sites by standard methods (BamHI and Bglll 
produce compatible termini). Positive clones are verified by DNA sequencing and designated pMal-CB. See Fig. 11. 

EXAMPLE V 

so Construction of cloning vector incorporating DNA constructs of Example II 

Ligation of TGF-B1 and Collagen l(a) segments into the pMal-c2 expression vector: The pMal-c2 vector is treated 
with Xmnl and Xbal, resolved by electrophoresis through an agarose gel and purified by standard methods. The Collagen 
l(a) segment with a 5' Xmnl site and a 3' Bglll restriction site and the TGF-B1 segment with a 5' Bglll site and a 3' Xbal 
55 site are combined with the digested and purified pMal-c2 plasmid for a three fragment ligation reaction using standard 
methods. Transformants are screened for the insert by standard techniques. Positive clones are verified by DNA sequenc- 
ing and designated pMal-CT See Fig. 12. 
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EXAMPLE VI 

Construction of cloning vector incorporating DNA constructs of Example III 

5 Ligation of Decorin and Collagen 1(a) segments into the pMal-c2 expression vector: The pMal-c2 vector is treated 

with Xmnl and Pstl, resolved by electrophoresis through an agarose gel and purified by standard methods. The Collagen 
1(a) segment with a 5' Xmnl site and a 3* Bglll restriction site and the Decorin segment with a 5* BamHI site and a 3' Pstl 
site are combined with the digested and purified pMal-c2 plasmid for a three fragment ligation reaction using standard 
methods (BamHI and Bglll produce compatible termini). Transformants are screened for the insert by standard tech- 

10 niques. Positive clones are verified by DNA sequencing and designated pMal-CD. See Fig. 13. 

EXAMPLE VII 

Transformation of E. Coli and Expression of a Collagen/TGF-p 1 and Collagen/Decorin Chimeric Genes in E. coli 

75 

Expression plasmids pMal-CB (Collagen-BMP2B Chimera), pMal-CT (Collagen-TGF-BI Chimera) and pMal-CD 
(Collagen-Decorin Chimera) are used to transform E.coli HB 101 using standard techniques. To express protein, a 50 
ml culture of E.coli harboring one of the expression vectors is inoculated into 1 L of LB broth and incubated with agitation 
at 37°C. When the Aeoo is 0.5±0.1 , 0.1 M IPTG is added to a final concentration of 1 .5-15 mM. The culture is maintained 

20 at 37°C until the Aqqo is 1 .3 to 1 .8 and the E.coli is harvested by centrifugation at 4000xg. The cell pellets are resuspended 
in 7.5 ml 20 mM Tris HCI pH 7.5, 200 mM NaCI, 1 mM EDTA (hereinafter "column buffer") and frozen in a dry ice/ethanol 
bath. The frozen cell pellets are thawed at 4°C, then sonicated on ice until the cells are disrupted. Cell debris is removed 
by centrifugation at 9,000xg at 4°C for 30 minutes. The supernatant fraction contains the E.coli crude cell lysate which 
is analyzed for protein production by SDS-PAGE. The recombinant protein products produced from these pMal vectors 

25 is a fusion protein with MBP (maltose binding protein). The MBP segment is included to allow a single step purification 
of the protein. 

The crude lysate is passed over an amylose column containing ml of resin/3 mg of recombinant protein (expected 
yield). The column is washed with 8 volumes of column buffer and the column flow through is reapplied to the column. 
Another 8 volumes of column buffer is used to wash the column. The fusion protein is eluted from the column using 

30 column buffer containing 10 mM Maltose. Fractions containing the recombinant chimeric protein are identified by the 
BCA protein assay (Pierce) and verified by SDS-PAGE. The fractions that contain the protein are pooled 

The MBP segment of the purified protein is cleaved from the collagen-growth factor chimera by treatment with factor 
Xa (New England Biolabs) at room temperature for 24 hours. The collagen-growth factor chimera is separated from the 
MBP segment by chromatography through an amylose column. The column flow through contains the collagen-growth 

35 factor chimera, which is analyzed by SDS-PAGE. Typical yield of purified protein range from 10-50 mg/liter of E.coli 
culture. 

EXAMPLE VIII 

40 Expression of a Collagen-Growth Factor Chimeric Genes in Sf9 Cells 

A useful alternative to the E.coli expression system is Baculovirus. The gene for the collagen-growth factor chimeras 
is modified to include an ATG start codon at the 5' end and a TAA stop codon at the 3' end. The transcriptional unit is 
ligated into the baculoviral transfer vector pBlueBac III Invitrogen). The resulting transfer vector is verified by DNA 

as sequencing. The collagen -growth factor chimera gene is transferred into the baculovirus genome (AcMNPV) by the 
standard in vivo recombination method. The pBlueBaclll transfer vector containing the collagen-growth factor chimera 
gene is cotransfected into Sf9 cells by standard methods. Recombinant viral plaques that are blue are selected and 
isolated by several rounds of reinfection. Pure recombinant baculovirus is verified by DNA sequencing. The recombinant 
virus containing the collagen-growth factor chimera gene is used to infect suspension cultures of Sf9 cells and optimal 

so protein expression is determined at 48-72 hours post-infection. The protein product is recovered from the culture medium 
and analyzed by SDS-PAGE. 

It will be understood that various modifications may be made to the embodiments disclosed herein. Therefore, the above 
description should not be construed as limiting, but merely as exemplifications of preferred embodiments. Those skilled 
in the art will envision other modifications within the scope and spirit of the claims appended hereto. 

55 
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The claims which follow identify embodiments of the invention additional to those described in detail above. 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: United States Surgical Corporation 

(B) STREET: 150 Glover Avenue 

(C) CITY: Norwalk 

(D) STATE: Connecticut 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 06856 

(ii) TITLE OF INVENTION: Recombinant chimeric proteins and methods of 

use thereof 

(iii) NUMBER OF SEQUENCES: 8 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 95109019.0 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 20. .3526 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GGGAAGGATT 


TCCATTTCCC 


AGCTGTCTTA TGGCTATGAT 


GAGAAATCAA 


CCGGAGGAAT 


60 


TTCCGTGCCT 


GGCCCCATGG 


GTCCCTCTGG 


TCCTCGTGGT 


CTCCCTGGCC 


CCCCTGGTGC 


120 


ACCTGGTCCC 


CAAGGCTTCC 


AAGGTCCCCC 


TGGTGAGCCT 


GGCGAGCCTG 


GAGCTTCAGG 


180 


TCCCATGGGT 


CCCCGAGGTC 


CCCCAGGTCC 


CCCTGGAAAG 


AATGGAGATG 


ATGGGGAAGC 


240 


TGGAAAACCT 


GGTCGTCCTG 


GTGAGCGTGG 


GCCTCCTGGG 


CCTCAGGGTG 


CTCGAGGATT 


300 


GCCCGGAACA 


GCTGGCCTCC 


CTGGAATGAA 


GGGACACAGA 


GGTTTCAGTG 


GTTTGGATGG 


360 


TGCCAAGGGA 


GATGCTGGTC 


CTGCTGGTCC 


TAAGGGTGAG 


CCTGGCAGCC 


CTGGTGAAAA 


420 


TGGAGCTCCT 


GGTCAGATGG 


GCCCCCGTGG 


CCTGCCTGGT 


GAGAGAGGTC 


GCCCTGGAGC 


480 


CCCTGGCCCT 


GCTGGTGCTC 


GTGGAAATGA 


TGGTGCTACT 


GGTGCTGCCG 


GGCCCCCTGG 


540 


TCCCACCGGC 


CCCGCTGGTC 


CTCCTGGCTT 


CCCTGGTGCT 


GTTGGTGCTA AGGGTGAAGC 


600 
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w 



15 



20 



25 



30 



35 



40 



50 



TGGTCCCCAA 


. GGGCCCCGAG 


: GCTCTGAAGG 


TCCCCAGGGT 


GTGCGTGGTG 


AGCCTGGCCC 


660 


CCCTGGCCCT 


GCTGGTGCTG 


CTGGCCCTGC 


TGGAAACCCT 


GGTGCTGATG 


GACAGCCTGG 


720 


TGCTAAAGGT 


GCCAATGGTG 


CTCCT GGTAT 


TGCTGGTGCT 


CCTGGCTTCC 


CTGGT GCCCG 


780 


AGGCCCCTCT 


GGACCCCAGG 


GCCCCGGCGG 


CCCTCCTGGT 


CCCAAGGGTA 


ACAGCGGTGA 


840 


ACCTGGTGCT 


CCTGGCAGCA 


AAGGAGACAC 


TGGTGCTAAG 


GGAGAGCCTG 


GCCCTGTTGG 


900 


TGTTCAAGGA 


CCCCCTGGCC 


CTGCTGGAGA 


GGAAGGAAAG 


CGAGGAGCTC 


GAGGTGAACC 


960 


CGGACCCACT 


GGCCTGCCCG 


GACCCCCTGG 


CGAGCGTGGT 


GGACCTGGTA 


GCCGTGGTTT 


1020 


CCCTGGCGCA 


GATGGTGTTG 


CTGGTCCCAA 


GGGTCCCGCT 


GGTGAACGTG 


GTTCTCCTGG 


1080 


CCCCGCTGGC 


C CGAAAG GAT 


CTCCT GGTGA 


AGCTGGTCGT 


CCCGGTGAAG 


CTGGTCTGCC 


1140 


TGGTGCCAAG 


GGTCTGACTG 


GAAGCCCTGG 


CAGCCCTGGT 


CCTGATGGCA 


AAACTGGCCC 


1200 


CCCTGGTCCC 


GCCGGTCAAG 


ATGGTCGCCC 


CGGACCCCCA 


GGCCCACCTG 


GTGCCCGTGG 


1260 


TCAGGCTGGT 


GTGAT GG GAT 


TCCCTGGACC 


TAAAGGTGCT 


GCT GGAGAGC 


CCGGCAAGGC 


1320 


TGGAGAGCGA 


GGTGTTCCCG 


GACCCCCTGG 


CGCTGTCGGT 


CCTGCTGGCA 


AAGAT GGAGA 


1380 


GGCTGGAGCT 


CAGGGACCCC 


CTGGCCCTGC 


TGGTCCCGCT 


GGCGAGAGAG 


GTGAACAAGG 


1440 


CCCTGCTGGC 


TCCCCCGGAT 


TCCAGGGTCT 


CCCTGGTCCT 


GCTGGTCCTC 


CAGGTGAAGC 


1500 


AGGCAAACCT 


GGTGAACAGG 


GTGTTCCTGG 


AGACCTTGGC 


GCCCCTGGCC 


CCTCTGGAGC 


1560 


AAGAGG C GAG 


AGAGGTTTCC 


CTGGCGAGCG 


TGGTGTGCAA 


GGTCCCCCTG 


GTCCTGCTGG 


1620 


ACCCCGAGGG 


GCCAACGGTG 


CTCCCGGCAA 


CGATGGTGCT 


AAGGGTGATG 


CTGGTGCCCC 


1680 


TGGAGCTCCC 


GGTAGCCAGG 


GCGCCCCTGG 


CCTTCAGGGA 


ATGCCTGGTG 


AACGTGGTGC 


1740 


AGCTGGTCTT 


CCAGGGCCTA 


AGGGTGACAG 


AGGT GATGCT 


GGTCCCAAAG 


GTGCTGATGG 


1800 


CTCTCCTGGC 


AAAGATGGCG 


TCCGTGGTCT 


GACCGGCCCC 


ATTGGTCCTC 


CTGGCCCTGC 


1860 


TGGTGCCCCT 


GGTGACAAGG 


GTGAAAGTGG 


TCCCAGCGGC 


CCTGCTGGTC 


CCACTGGAGC 


1920 


TCGTGGTGCC 


CCCGGAGACC 


GTGGTGAGCC 


TGGTCCCCCC 


GGCCCTGCTG 


GCTTTGCTGG 


1980 


CCCCCCTGGT 


GCTGACGGCC 


AACCTGGTGC 


TAAAGGCGAA 


CCTGGTGATG 


CTGGTGCCAA 


2040 


AGGCGATGCT 


GGTCCCCCTG 


GGCCTGCCGG 


ACCCGCTGGA 


CCCCCTGGCC 


C CAT T GGTAA 


2100 


TGTTGGTGCT 


CCTGGAGCCA 


AAGGTGCTCG 


CGGCAGCGCT 


GGTCCCCCTG 


GTGCTACTGG 


2160 


TTTCCCTGGT 


GCTGCTGGCC 


GAGTCGGTCC 


TCCTGGCCCC 


TCTGGAAATG 


CTGGACCCCC 


2220 


TGGCCCTCCT 


GGTCCTGCTG 


GCAAAGAAGG 


CGGCAAAGGT 


CCCCGTGGTG 


AGACTGGCCC 


2280 


TGCTGGACGT 


CCTGGTGAAG 


TTGGTCCCCC 


TGGTCCCCCT 


GGCCCTGCTG 


GCGAGAAAGG 


2340 


ATCCCCTGGT 


GCTGATGGTC 


CTGCTGGTGC 


TCCTGGTACT 


CCCGGGCCTC 


AAGGTATTGC 


2400 


TGGACAGCGT 


GGTGTGGTCG 


GCCTGCCTGG 


TCAGAGAGGA 


GAGAGAGGCT 


TCCCTGGTCT 


2460 


TCCTGGCCCC 


TCTGGTGAAC 


CTGGCAAACA 


AGGTCCCTCT 


GGAGCAAGTG 


GTGAACGTGG 


2520 


TCCCCCCGGT 


CCCATGGGCC 


CCCCTGGATT 


GGCTGGACCC 


CCTGGTGAAT 


CTGGACGTGA 


2580 
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GGGGGCTCCT 


GCTGCCGAAG 


GTTCCCCTGG 


ACGAGACGGT 


TCTCCTGGCG 


CCAAGGGTGA 


2640 


CCGTGGTGAG 


ACCGGCCCCG 


CTGGACCCCC 

w X wwXXw w w w w 


TGGTGCTCNT 

X WW X VV M Vll X 


GGTGCTCNTG 

w w X W w X A W 


GTGCCCCTGG 

X ww www X ww 


2700 






AG AGT GGT GA 


TCGTGGTGAG 

X WW X Ww X \3JT*W 


ACTGGTCCTG 

Aw A Ww 4. ww X W 


CTGGTCCCGC 

w X V7W X www ww 


2760 


ccKWccctvrr 






rrrrcrcGGA 


CCCCAAGGCC 

W w w wXV^ww WW 


CCT GTGGTGA 


2920 








AGGPATAAAG 


GGTCACCGTG 

w w X vAwww X VJ 


GCTTCTCTGG 

WW X 1. lu» i. w X WW 


2880 


LUX V*v-HIjIj« 1 




CTCCTGGCTC 


TCCTGGTGAA 


CAAGGT CC CT 


CTGGAGCCTC 


2940 


TGGTCCTGCT 




GTCCCCCTGG 


CTCTGCTGGT 




AAGATfiGACT 


3000 


CAACGGTC1 L- 




TTGGGCCCCC 


TGGTCCTCGC 


GGTCGCACTG 


GTGATGCTGG 


3060 






CTCCTGGACC 


TCCTGGTCCC 


CCTGGTCCTC 


CCAGCGCTGG 


3120 


TTTCGACTTC 


AGCTTCCTCC 


CCCAGCCACC 


TCAAGAGAAG 


GCTCACGATG 


GTGGCCGCTA 


3180 


CTACCGGGCT 


AGATCCCAGC 


GGGCCAGGAA 


GAAGAATAAG 


AACTGCCGGC 


GCCACTCGCT 


3240 


CTATGTGGAC 


TTCAGCGATG 


TGGGCTGGAA TGACTGGATT 


GTGGCCCCAC 


CAGGCTACCA 


3300 


GGCCTTCTAC 


TGCCATGGGG 


ACTGCCCCTT 


TCCACTGGCT 


GACCACCTCA 


ACTCAACCAA 


3360 


CCATGCCATT 


GTGCAGACCC 


TGGTCAATTC 


TGTCAATTCC 


AGTATCCCCA AAGCCTGTTG 


3420 


TGTGCCCACT 


GAACTGAGTG 


CCATCTCCAT 


GCTGTACCTG 


GATGAGTATG 


ATAAGGTGGT 


3480 


ACTGAAAAAT 


TATCAGGAGA 


TGGTAGTAGA 


GGGATGTGGG 


TGCCGCTAAA AGCTT 


3535 



(2) INFORMATION FOR SEQ ID NO: 2; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3541 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Homo sapiens 

40 (ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 20. .3532 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

45 GGGAAGGATT TCCATTTCCC AGCTGTCTTA TGGCTATGAT GAGAAATCAA CCGGAGGAAT 60 

TTCCGTGCCT GGCCCCATGG GTCCCTCTGG TCCTCGTGGT CTCCCTGGCC CCCCTGGTGC 120 

ACCTGGTCCC CAAGGCTTCC AAGGTCCCCC TGGTGAGCCT GGCGAGCCTG GAGCTTCAGG 180 

SO TCCCATGGGT CCCCGAGGTC CCCCAGGTCC CCCTGGAAAG AATGGAGATG ATGGGGAAGC 240 

TGGAAAACCT GGTCGTCCTG GTGAGCGTGG GCCTCCTGGG CCTCAGGGTG CTCGAGGATT 300 

GCCCGGAACA GCTGGCCTCC CTGGAATGAA GGGACACAGA GGTTTCAGTG GTTTGGATGG 360 
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TGCCAAGGGA GATGCTGGTC CTGCTGGTCC 
5 TGGAGCTCCT GGTCAGATGG GCCCCCGTGG 

CCCTGGCCCT GCTGGTGCTC GTGGAAATGA 
TCCCACCGGC CCCGCTGGTC CTCCTGGCTT 
TGGTCCCCAA GGGCCCCGAG GCTCTGAAGG 

10 

CCCTGGCCCT GCTGGTGCTG CTGGCCCTGC 
TGCTAAAGGT GCCAATGGTG CTCCTGGTAT 
AGGCCCCTCT GGACCCCAGG GCCCCGGCGG 

75 

ACCTGGTGCT CCTGGCAGCA AAGGAGACAC 
TGTTCAAGGA CCCCCTGGCC CTGCTGGAGA 
CGGACCCACT GGCCTGCCCG GACCCCCTGG 

20 CCCTGGCGCA GATGGTGTTG CTGGTCCCAA 

CCCCGCTGGC CCCAAAGGAT CTCCTGGTGA 
TGGTGCCAAG GGTCTGACTG GAAGCCCTGG 

25 CCCTGGTCCC GCCGGTCAAG ATGGTCGCCC 

TCAGGCTGGT GT GAT GGGAT TCCCTGGACC 
TGGAGAGCGA GGTGTTCCCG GACCCCCTGG 

3Q GGCTGGAGCT CAGGGACCCC CTGGCCCTGC 

CCCTGCTGGC TCCCCCGGAT TCCAGGGTCT 
AGGCAAACCT GGTGAACAGG GTGTTCCTGG 
AAGAGGC GAG AGAGGTTTCC CTGGCGAGCG 

35 

ACCCCGAGGG GCCAACGGTG CTCCCGGCAA 
TGGAGCTCCC GGTAGCCAGG GCGCCCCTGG 
AGCTGGTCTT CCAGGGCCTA AGGGT GACAG 

40 

CTCTCCTGGC AAAGATGGCG TCCGTGGTCT 
TGGTGCCCCT GGTGACAAGG GTGAAAGTGG 
TCGTGGTGCC CCCGGAGACC GTGGTGAGCC 

45 CCCCCCTGGT GCTGACGGCC AACCTGGTGC 

AGGCGATGCT GGTCCCCCTG GGCCTGCCGG 
TGTTGGTGCT CCTGGAGCCA AAGGTGCTCG 

50 TTTCCCTGGT GCTGCTGGCC GAGTCGGTCC 

TGGCCCTCCT GGTCCTGCTG GCAAAGAAGG 
TGCTGGACGT CCTGGTGAAG TTGGTCCCCC 



TAAGGGTGAG CCTGGCAGCC CTGGTGAAAA 420 

CCTGCCTGGT GAGAGAGGTC GCCCTGGAGC 480 

TGGTGCTACT GGTGCTGCCG GGCCCCCTGG 54 0 

CCCTGGTGCT GTTGGTGCTA AGGGT GAAGC 600 

TCCCCAGGGT GTGCGTGGTG AGCCTGGCCC 660 

TGGAAACCCT GGTGCTGATG GACAGCCTGG 720 

TGCTGGTGCT CCTGGCTTCC CTGGTGCCCG 780 

CCCTCCTGGT C CCAAGGGT A ACAGCGGTGA 84 0 

TGGTGCTAAG GGAGAGCCTG GCCCTGTTGG 900 

GGAAGGAAAG CGAGGAGCTC GAGGT GAACC 960 

CGAGCGTGGT GGACCTGGTA GCCGTGGTTT 1020 

GGGTCCCGCT GGTGAACGTG GTTCTCCTGG 10 8 0 

AGCTGGTCGT CCCGGTGAAG CTGGTCTGCC 1140 

CAGCCCTGGT CCTGATGGCA AAACTGGCCC 1200 

CGGACCCCCA GGCCCACCTG GTGCCCGTGG 1260 

TAAAGGT GCT GCTGGAGAGC CCGGCAAGGC 1320 

CGCTGTCGGT CCTGCTGGCA AAGAT GGAGA 1380 

TGGTCCCGCT GGCGAGAGAG GTGAACAAGG 14 4 0 

CCCTGGTCCT GCTGGTCCTC CAGGT GAAGC 1500 

AGACCTTGGC GCCCCTGGCC CCTCTGGAGC 1560 

TGGTGTGCAA GGTCCCCCTG GTCCTGCTGG 1620 

CGATGGTGCT AAGGGTGATG CTGGTGCCCC 168 0 

CCTTCAGGGA ATGCCTGGTG AACGTGGTGC 174 0 

AGGTGATGCT GGTCCCAAAG GTGCTGATGG 1800 

GACCGGCCCC ATTGGTCCTC CTGGCCCTGC 18 60 

TCCCAGCGGC CCTGCTGGTC CCACTGGAGC 1920 

TGGTCCCCCC GGCCCTGCTG GCTTTGCTGG 1980 

TAAAGGCGAA CCTGGTGATG CTGGTGCCAA 204 0 

ACCCGCTGGA CCCCCTGGCC CCATTGGTAA 2100 

CGGCAGCGCT GGTCCCCCTG GTGCTACTGG 2160 

TCCTGGCCCC TCTGGAAATG CTGGACCCCC 2220 

CGGCAAAGGT CCCCGTGGTG AGACTGGCCC 2280 

TGGTCCCCCT GGCCCTGCTG GCGAGAAAGG 234 0 
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ATCCCCTGGT GCTGATGGTC CTGCTGGTGC TCCTGGTACT CCCGGGCCTC AAGGTATTGC 2400 

TGGACAGCGT GGTGTGGTCG GCCTGCCTGG TCAGAGAGGA GAGAGAGGCT TCCCTGGTCT 2460 

TCCTGGCCCC TCTGGTGAAC CTGGCAAACA AGGTCCCTCT GGAGCAAGTG GTGAACGTGG 2520 

TCCCCCCGGT CCCATGGGCC CCCCTGGATT GGCTGGACCC CCTGGTGAAT CTGGACGTGA 2580 

GGGGGCTCCT GCTGCCGAAG GTTCCCCTGG ACGAGACGGT TCTCCTGGCG CCAAGGGTGA 2640 

CCGTGGTGAG ACCGGCCCCG CTGGACCCCC TGGTGCTCNT GGTGCTCNTG GTGCCCCTGG 27 00 

CCCCGTTGGC CCTGCTGGCA AGAGTGGTGA TCGTGGTGAG ACTGGTCCTG CTGGTCCCGC 2760 

CGGTCCCGTC GGCCCCGCTG GCGCCCGTGG CCCCGCCGGA CCCCAAGGCC CCCGTGGTGA 2820 

CAAGGGTGAG ACAGGCGAAC AGGGCGACAG AGGCATAAAG GGTCACCGTG GCTTCTCTGG 2880 

CCTCCAGGGT CCCCCTGGCC CTCCTGGCTC TCCTGGTGAA CAAGGTCCCT CTGGAGCCTC 2940 

TGGTCCTGCT GGTCCCCGAG GTCCCCCTGG CTCTGCTGGT GCTCCTGGCA AAGATGGACT 3000 

20 CAACGGTCTC CCTGGCCCCA TTGGGCCCCC TGGTCCTCGC GGTCGCACTG GTGATGCTGG 3060 

TCCTGTTGGT CCCCCCGGCC CTCCTGGACC TCCTGGTCCC CCTGGTCCTC CCAGCGCTGG 3120 

TTTCGACTTC AGCTTCCTCC CCCAGCCACC TCAAGAGAAG GCTCACGATG GTGGCCGCTA 318 0 

25 CTACCGGGCT AGATCTGCCC TGGACACCAA CTATTGCTTC AGCTCCACGG AGAAGAACTG 3240 

CTGCGTGCGG CAGCTGTACA TTGACTTCCG CAAGGACCTC GGCTGGAAGT GGATCCACGA 3300 

GCCCAAGGGC TACCATGCCA ACTTCTGCCT CGGGCCCTGC CCCTACATTT GGAGCCTGGA 3360 

CACGCAGTAC AGCAAGGTCC TGGCCCTGTA CAACCAGCAT AACCCGGGCG CCTCGGCGGC 3420 

GCCGTGCTGC GTGCCGCAGG CGCTGGAGCC GCTGCCCATC GTGTACTACG TGGGCCGCAA 3480 

gcccaaggtg gagcagctgt CCAACATGAT CGTGCGCTCC TGCAAGTGCA GCTGAT'CTAG 354 g 

A 3541 
(2) INFORMATION FOR SEQ ID NO: 3: 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4192 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Homo sapiens 

45 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 20. .4183 

50 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGGAAGGATT TCCATTTCCC AGCTGTCTTA TGGCTATGAT GAGAAATCAA CCGGAGGAAT 60 
TTCCGTGCCT GGCCCCATGG GTCCCTCTGG TCCTCGTGGT CTCCCTGGCC CCCCTGGTGC 120 

55 
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ACCTGGTCCC CAAGGCTTCC AAGGTCCCCC TGGTGAGCCT GGCGAGCCTG GAGCTTCAGG 180 

TCCCATGGGT CCCCGAGGTC CCCCAGGTCC CCCTGGAAAG AATGGAGATG ATGGGGAAGC 240 

TGGAAAACCT GGTCGTCCTG GTGAGCGTGG GCCTCCTGGG CCTGAGGGTG CTCGAGGATT 300 

GCCCGGAACA GCTGGCCTCC CTGGAATGAA GGGACACAGA GGTTTCAGTG GTTTGGATGG 360 

TGCCAAGGGA GATGCTGGTC CTGCTGGTCC TAAGGGTGAG CCTGGCAGCC CTGGTGAAAA 420 

TGGAGCTCCT GGTCAGATGG GCCCCCGTGG CCTGCCTGGT GAGAGAGGTC GCCCTGGAGC 48 0 

CCCTGGCCCT GCTGGTGCTC GTGGAAATGA TGGTGCTACT GGTGCTGCCG GGCCCCCTGG 54 0 

TCCCACCGGC CCCGCTGGTC CTCCTGGCTT CCCTGGTGCT GTTGGTGCTA AG GGT GAAGC 600 

TGGTCCCCAA GGGCCCCGAG GCTCTGAAGG TCCCCAGGGT GTGCGTGGTG AGCCTGGCCC 660 

CCCTGGCCCT GCTGGTGCTG CTGGCCCTGC TGGAAACCCT GGTGCTGATG GACAGCCTGG 720 

TGCTAAAGGT GCCAATGGTG CTCCTGGTAT TGCTGGTGCT CCTGGCTTCC CTGGTGCCCG 780 

20 AGGCCCCTCT GGACCCCAGG GCCCCGGCGG CCCTCCTGGT CCCAAGGGTA ACAGCGGTGA 84 0 

ACCTGGTGCT CCTGGCAGCA AAGGAGACAC TGGTGCTAAG GGAGAGCCTG GCCCTGTTGG 900 

TGTTCAAGGA CCCCCTGGCC CTGCTGGAGA GGAAGGAAAG CGAGGAGCTC GAGGTGAACC 960 

25 CGGACCCACT GGCCTGCCCG GACCCCCTGG CGAGCGTGGT GGACCTGGTA GCCGTGGTTT 1020 

CCCTGGCGCA GATGGTGTTG CTGGTCCCAA GGGTCCCGCT GGTGAACGTG GTTCTCCTGG 1080 

CCCCGCTGGC CCCAAAGGAT CTCCTGGTGA AGCTGGTCGT CCCGGTGAAG CTGGTCTGCC 1140 

TGGTGCCAAG GGTCTGACTG GAAGC CCTGG CAGCCCTGGT CCTGATGGCA AAACTGGCCC 12 00 

CCCTGGTCCC GCCGGTCAAG ATGGTCGCCC CGGACCCCCA GGCCCACCTG GTGCCCGTGG 1260 

TCAGGCTGGT GTGATGGGAT TCCCTGGACC TAAAGGTGCT GCT GGAGAGC CCGGCAAGGC 1320 

TGGAGAGCGA GGTGTTCCCG GACCCCCTGG CGCTGTCGGT CCTGCTGGCA AAGATGGAGA 1380 

GGCTGGAGCT CAGGGACCCC CTGGCCCTGC TGGTCCCGCT GGCGAGAGAG GTGAACAAGG 1440 

CCCTGCTGGC TCCCCCGGAT TCCAGGGTCT CCCTGGTCCT GCTGGTCCTC CAGGT GAAGC 1500 

AG GCAAAC CT GGT GAACAGG GTGTTCCTGG AGACCTTGGC GCCCCTGGCC CCTCTGGAGC 1560 

AAGAGGCGAG AGAGGTTTCC CTGGCGAGCG TGGTGTGCAA GGTCCCCCTG GTCCTGCTGG 1620 

ACCCCGAGGG GCCAACGGTG CTCCCGGCAA C GAT GGT GCT AAGGGTGATG CTGGTGCCCC 168 0 

TGGAGCTCCC GGTAGCCAGG GCGCCCCTGG CCTTCAGGGA AT GC CT GGT G AACGTGGTGC 174 0 

45 AGCTGGTCTT CCAGGGCCTA AGGGTGACAG AGGTGAT GCT GGTCCCAAAG GTGCTGATGG 18 00 

CTCTCCTGGC AAAGATGGCG TCCGTGGTCT GACCGGCCCC ATTGGTCCTC CTGGCCCTGC 1860 

TGGTGCCCCT GGT GACAAGG GTGAAAGTGG TCCCAGCGGC CCTGCTGGTC CCACTGGAGC 1920 

5Ct TCGTGGTGCC CCCGGAGACC GT GGT GAG C C TGGTCCCCCC GGCCCTGCTG GCTTTGCTGG 1980 

CCCCCCTGGT GCTGACGGCC AACCTGGT GC TAAAGGCGAA CCTGGTGATG CTGGTGCCAA 2040 

AGGCGATGCT GGTCCCCCTG GGCCTGCCGG ACCCGCTGGA CCCCCTGGCC CCATTGGTAA 2100 

55 
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TGTTGGTGCT CCTGGAGCCA AAGGTGCTCG CGGCAGCGCT GGTCCCCCTG GTGCTACTGG 2160 

TTTCCCTGGT GCTGCTGGCC GAGTCGGTCC TCCTGGCCCC TCTGGAAATG CTGGACCCCC 2220 

TGGCCCTCCT GGTCCTGCTG GCAAAGAAGG CGGCAAAGGT CCCCGTGGTG AGACTGGCCC 2280 

TGCTGGACGT CCTGGTGAAG TTGGTCCCCC TGGTCCCCCT GGCCCTGCTG GCGAGAAAGG 2340 

ATCCCCTGGT GCTGATGGTC CTGCTGGTGC TCCTGGTACT CCCGGGCCTC AAGGTATTGC 2400 

TGGACAGCGT GGTGTGGTCG GCCTGCCTGG TCAGAGAGGA GAGAGAGGCT TCCCTGGTCT 2460 

TCCTGGCCCC TCTGGTGAAC CTGGCAAACA AGGTCCCTCT GGAGCAAGTG GTGAACGTGG 2520 

TCCCCCCGGT CCCATGGGCC CCCCTGGATT GGCTGGACCC CCTGGTGAAT CTGGACGTGA 258 0 

GGGGGCTCCT GCTGCCGAAG GTTCCCCTGG ACGAGACGGT TCTCCTGGCG CCAAGGGTGA 264 0 

CCGTGGTGAG ACCGGCCCCG CTGGACCCCC TGGTGCTCNT GGTGCTCNTG GTGCCCCTGG 2700 

CCCCGTTGGC CCTGCTGGCA AGAGTGGTGA TCGTGGTGAG ACTGGTCCTG CTGGTCCCGC 2760 

CGGTCCCGTC GGCCCCGCTG GCGCCCGTGG CCCCGCCGGA CCCCAAGGCC CCCGTGGTGA 2820 

CAAGGGTGAG ACAGGCGAAC AGGGCGACAG AGGCATAAAG GGTCACCGTG GCTTCTCTGG 2880 

CCTCCAGGGT CCCCCTGGCC CTCCTGGCTC TCCTGGTGAA CAAGGTCCCT CTGGAGCCTC 294 0 

25 TGGTCCTGCT GGTCCCCGAG GTCCCCCTGG CTCTGCTGGT GCTCCTGGCA AAGATGGACT 3000 

CAACGGTCTC CCTGGCCCCA TTGGGCCCCC TGGTCCTCGC GGTCGCACTG GTGATGCTGG 3060 

TCCTGTTGGT CCCCCCGGCC CTCCTGGACC TCCTGGTCCC CCTGGTCCTC CCAGCGCTGG 3120 

TTTCGACTTC AGCTTCCTCC CCCAGCCACC TCAAGAGAAG GCTCACGATG GTGGCCGCTA 3180 

CTACCGGGCT AGATCTGATG AGGCTTCTGG GATAGGCCCA GAAGTTCCTG ATGACCGCGA 3240 

CTTCGAGCCC TCCCTAGGCC CAGTGTGCCC CTTCCGCTGT CAATGCCATC TTCGAGTGGT 3300 

CCAGTGTTCT GATTTGGGTC TGGACAAAGT GCCAAAGGAT CTTCCCCCTG ACACAACTCT 3360 

GCTAGACCTG CAAAACAACA AAATAACCGA AATCAAAGAT GGAGACTTTA AGAACCTGAA 3420 

GAACCTTCAC GCATTGATTC TTGTCAACAA TAAAATTAGC AAAGTTAGTC CTGGAGCATT 3480 

TACACCTTTG GTGAAGTTGG AACGACTTTA TCTGTCCAAG AATCAGCTGA AGGAATTGCC 3540 

AGAAAAAATG CCCAAAACTC TTCAGGAGCT GCGTGCCCAT GAGAATGAGA TCACCAAAGT 3600 

GCGAAAAGTT ACTTTCAATG GACTGAACCA GATGATTGTC ATAGAACTGG GCACCAATCC 3660 

GCTGAAGAGC TCAGGAATTG AAAATGGGGC TTTCCAGGGA ATGAAGAAGC TCTCCTACAT 3720 

45 CCGCATTGCT GATACCAATA TCACCAGCAT TCCTCAAGGT CTTCCTCCTT CCCTTACGGA 37 80 

ATTACATCTT GATGGCAACA AAATCAGCAG AGTTGATGCA GCTAGCCTGA AAGGACTGAA 3840 

TAATTTGGCT AAGTTGGGAT TGAGTTTCAA CAGCATCTCT GCTGTTGACA ATGGCTCTCT 3900 

SO GGCCAACACG CCTCATCTGA GGGAGCTTCA CTTGGACAAC AACAAGCTTA CCAGAGTACC 3960 

TGGTGGGCTG GCAGAGCATA AGTACATCCA GGTTGTCTAC CTTCATAACA ACAATATCTC 4 020 

TGTAGTTGGA TCAAGTGACT TCTGCCCACC TGGACACAAC ACCAAAAAGG CTTCTTATTC 4080 
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GGGTGTGAGT CTTTTCAGCA ACCCGGTCCA GTACTGGGAG AT ACAG C CAT CCACCTTCAG 414 0 

ATGTGTCTAC GTGCGCTCTG CCATTCAACT CGGAAACTAT AAGTAACTGC AG 4192 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3349 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 
15 <A) ORGANISM: Homo sapiens 

(ix! FEATURE: 

(A) NAME/ KEY : CDS 

(BJ LOCATION: 20. . 334 0 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGGAAGGATT TCCATTTCCC AGCTGTCTTA TGGCTATGAT GAGAAATCAA CCGGAGGAAT 60 

TTCCGTGCCT GGCCCCATGG GTCCCTCTGG TCCTCGTGGT CTCCCTGGCC CCCCTGGTGC 120 

ACCTGGTCCC CAAGGCTTCC AAGGTCCCCC TGGTGAGCCT GGCGAGCCTG GAGCTTCAGG 180 

TCCCATGGGT CCCCGAGGTC CCCCAGGTCC CCCTGGAAAG AATGGAGATG ATGGGGAAGC 240 

TGGAAAACCT GGTCGTCCTG GTGAGCGTGG GCCTCCTGGG CCTCAGGGTG CTCGAGGATT 300 

GCCCGGAACA GCTGGCCTCC CTGGAATGAA GGGACACAGA GGTTTCAGTG GTTTGGATGG 360 

30 TGCCAAGGGA GATGCTGGTC CTGCTGGTCC TAAGGGTGAG CCTGGCAGCC CTGGTGAAAA 420 

TGGAGCTCCT GGTCAGATGG GCCCCCGTGG CCTGCCTGGT GAGAGAGGTC GCCCTGGAGC 480 

CCCTGGCCCT GCTGGTGCTC GTGGAAATGA TGGTGCTACT GGTGCTGCCG GGCCCCCTGG 54 0 

35 TCCCACCGGC CCCGCTGGTC CTCCTGGCTT CCCTGGTGCT GTTGGTGCTA AGGGTGAAGC 600 

TGGTCCCCAA GGGCCCCGAG GCTCTGAAGG TCCCCAGGGT GTGCGTGGTG AGCCTGGCCC 660 

CCCTGGCCCT GCTGGTGCTG CTGGCCCTGC TGGAAACCCT GGTGCTGATG GACAGCCTGG 720 

T GCTAAAG GT GCCAATGGTG CTCCTGGTAT TGCTGGTGCT CCTGGCTTCC CTGGTGCCCG 780 

AGGCCCCTCT GGACCCCAGG GCCCCGGCGG CCCTCCTGGT CCCAAGGGTA ACAGCGGTGA 8 40 

ACCTGGT GCT CCTGGCAGCA AAGGAGACAC TGGTGCTAAG GGAGAGCCTG GCCCTGTTGG 900 

TGTTCAAGGA CCCCCTGGCC CTGCTGGAGA GGAAGGAAAG CGAGGAGCTC GAGGTGAACC 9 60 

CGGACCCACT GGCCTGCCCG GACCCCCTGG CGAGCGTGGT GGACCTGGTA GCCGTGGTTT 1020 

CCCTGGCGCA GATGGTGTTG CTGGTCCCAA GGGTCCCGCT GGTGAACGTG GTTCTCCTGG 108 0 

CCCCGCTGGC CCCAAAGGAT CTCCTGGTGA AGCTGGTCGT CCCGGTGAAG CTGGTCTGCC 114 0 

TGGTGCCAAG GGTCTGACTG GAAGCCCTGG CAGCCCTGGT CCTGATGGCA AAACTGGCCC 12 00 

CCCTGGTCCC GCCGGTCAAG ATGGTCGCCC CGGACCCCCA GGCCCACCTG GTGCCCGTGG 1260 

TCAGGCTGGT GT GAT G G GAT TCCCTGGACC TAAAGGTGCT GCTGGAGAGC CCGGCAAGGC 132 0 
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TGGAGAGCGA GGTGTTCCCG GACCCCCTGG CGCTGTCGGT CCTGCTGGCA AAGATGGAGA 1380 

GGCTGGAGCT CAGGGACCCC CTGGCCCTGC TGGTCCCGCT GGCGAGAGAG GTGAACAAGG 144 0 

CCCTGCTGGC TCCCCCGGAT TCCAGGGTCT CCCTGGTCCT GCTGGTCCTC CAGGTGAAGC 1500 

AGGCAAACCT GGTGAACAGG GTGTTCCTGG AGACCTTGGC GCCCCTGGCC CCTCTGGAGC 1560 

AAGAGGCGAG AGAGGTTTCC CTGGCGAGCG TGGTGTGCAA GGTCCCCCTG GTCCTGCTGG 1620 

ACCCCGAGGG GCCAACGGTG CTCCCGGCAA CGATGGTGCT AAGGGTGATG CTGGTGCCCC 16B0 

TGGAGCTCCC GGTAGCCAGG GCGCCCCTGG CCTTCAGGGA ATGCCTGGTG AACGTGGTGC 1740 

AGCTGGTCTT CCAGGGCCTA AGGGTGACAG AGGTGATGCT GGTCCCAAAG GT6CTGATGG 1800 

CTCTCCTGGC AAAGAT GGCG TCCGTGGTCT GACCGGCCCC ATTGGTCCTC CTGGCCCTGC 1860 

TGGTGCCCCT GGTGACAAGG GTGAAAGTGG TCCCAGCGGC CCTGCTGGTC CCACTGGAGC 1920 

TCGTGGTGCC CCCGGAGACC GTGGTGAGCC TGGTCCCCCC GGCCCTGCTG GCTTTGCTGG 1980 

CCCCCCTGGT GCTGACGGCC AACCTGGTGC TAAAGGCGAA CCTGGTGATG CTGGTGCCAA 2040 

AGGCGATGCT GGTCCCCCTG GGCCTGCCGG ACCCGCTGGA CCCCCTGGCC CCATTGGTAA 2100 

TGTTGGTGCT CCTGGAGCCA AAGGTGCTCG CGGCAGCGCT GGTCCCCCTG GTGCTACTGG 2160 

25 TTTCCCTGGT GCTGCTGGCC GAGTCGGTCC TCCTGGCCCC TCTGGAAATG CTGGACCCCC 2220 

TGGCCCTCCT GGTCCTGCTG GCAAAGAAGG CGGCAAAGGT CCCCGTGGTG AGACTGGCCC 2280 

TGCTGGACGT CCTGGTGAAG TTGGTCCCCC TGGTCCCCCT GGCCCTGCTG GCGAGAAAGG 2340 

ATCCCCTGGT GCTGATGGTC CTGCTGGTGC TCCTGGTACT CCCGGGCCTC AAGGTATTGC 2400 

TGGACAGCGT GGTGTGGTCG GCCTGCCTGG TCAGAGAGGA GAGAGAGGCT TCCGTGGTCT 24 60 

TCCTGGCCCC TCTGGTGAAC CTGGCAAACA AGGTCCCTCT GGAGCAAGTG GTGAACGTGG 2520 

TCCCCCCGGT CCCATGGGCC CCCCTGGATT GGCTGGACCC CCTGGTGAAT CTGGACGTGA 25 BO 

GGGGGCTCCT GCTGCCGAAG GTTCCCCTGG ACGAGACGGT TCTCCTGGCG CCAAGGGTGA 2640 

CCGTGGTGAG ACCGGCCCCG CTGGACCCCC TGGTGCTCNT GGTGCTCNTG GTGCCCCTGG 2700 

CCCCGTTGGC CCTGCTGGCA AGAGTGGTGA TCGTGGTGAG ACTGGTCCTG CTGGTCCCGC 2760 

CGGTCCCGTC GGCCCCGCTG GCGCCCGTGG CCCCGCCGGA CCCCAAGGCC CCCGTGGTGA 2820 

CAAGGGTGAG ACAGGCGAAC AGGGCGACAG AGGCATAAAG GGTCACCGTG GCTTCTCTGG 2880 

CCTCCAGGGT CCCCCTGGCC CTCCTGGCTC TCCTGGTGAA CAAGGTCCCT CTGGAGCCTC 294 0 

TGGTCCTGCT GGTCCCCGAG GTCCCCCTGG CTCTGCTGGT GCTCCTGGCA AAGATGGACT 3000 

CAACGGTCTC CCTGGCCCCA TTGGGCCCCC TGGTCCTCGC GGTCGCACTG GTGATGCTGG 3060 

TCCTGTTGGT CCCCCCGGCC CTCCTGGACC TCCTGGTCCC CCTGGTCCTC CCAGCGCTGG 3120 

50 TTTCGACTTC AGCTTCCTCC CCCAGCCACC TCAAGAGAAG GCTCACGATG GTGGCCGCTA 3180 

CTACCGGGCT AGATCTCCAA AGGATCTTCC CCCTGACACA ACTCTGCTAG ACCTGCAAAA 324 0 

CAACAAAATA ACCGAAATCA AAGATGGAGA CTTTAAGAAC CTGAAGAACC TTCACGCATT 3300 
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GATTCTTGTC AACAATAAAA TTAGCAAAGT TAGTCCTGGA TAACTGCAG 334 9 

(2) INFORMATION FOR 5EQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1169 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Gin Leu Sec Tyr Gly Tyr Asp Glu Lys Ser Thr Gly Gly lie Ser Val 
15 10 15 

Pro Gly Pro Met Gly Pro Ser Gly Pro Arg Gly Leu Pro Gly Pro Pro 
20 25 30 

Gly Ala Pro Gly Pro Gin Gly Phe Gin Gly Pro Pro Gly Glu Pro Gly 
35 40 45 

Glu Pro Gly Ala Ser Gly Pro Met Gly Pro Arg Gly Pro Pro Gly Pro 
50 55 60 

Pro Gly Lys Asn Gly Asp Asp Gly Glu Ala Gly Lys Pro Gly Arg Pro 
65 70 75 80 

Gly Glu Arg Gly Pro Pro Gly Pro Gin Gly Ala Arg Gly Leu Pro Gly 
85 90 95 

Thr Ala Gly Leu Pro Gly Met Lys Gly His Arg Gly Phe Ser Gly Leu 
100 105 110 

Asp Gly Ala Lys Gly Asp Ala Gly Pro Ala Gly Pro Lys Gly Glu Pro 
30 H5 120 125 

Gly Ser Pro Gly Glu Asn Gly Ala Pro Gly Gin Met Gly Pro Arg Gly 
130 135 140 

Leu Pro Gly Glu Arg Gly Arg Pro Gly Ala Pro Gly Pro Ala Gly Ala 
35 145 150 155 160 

Arg Gly Asn Asp Gly Ala Thr Gly Ala Ala Gly Pro Pro Gly Pro Thr 
165 170 175 



Gly Pro Ala Gly Pro Pro Gly Phe Pro Gly Ala Val Gly Ala Lys Gly 

180 185 190 

Glu Ala Gly Pro Gin Gly Pro Arg Gly Ser Glu Gly Pro Gin Gly Val 
195 200 205 

Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Ala Ala Gly Pro Ala 
210 215 220 

Gly Asn Pro Gly Ala Asp Gly Gin Pro Gly Ala Lys Gly Ala Asn Gly 
225 230 235 240 

Ala Pro Gly lie Ala Gly Ala Pro Gly Phe Pro Gly Ala Arg Gly Pro 
245 250 255 

Ser Gly Pro Gin Gly Pro Gly Gly Pro Pro Gly Pro Lys Gly Asn Ser 

260 265 270 

Gly Glu Pro Gly Ala Pro Gly Ser Lys Gly Asp Thr Gly Ala Lys Gly 



BNSDOCID <EP 0704532A2 t > 



21 



EP0 704 532 A2 



275 



280 



285 



Glu Pro Gly Pro Val Gly Val Gin Gly Pro Pro Gly Pro Ala Gly Glu 

290 295 300 

Glu Gly Lys Arg Gly Ala Arg Gly Glu Pro Gly Pro Thr Gly Leu Pro 

305 310 315 320 



10 



Gly Pro Pro Gly Glu Arg Gly Gly Pro Gly Ser Arg Gly Phe Pro Gly 
325 330 335 



20 



25 



30 



35 



40 



Ala Asp Gly Val Ala Gly Pro Lys Gly Pro Ala Gly Glu Arg Gly Ser 
340 345 350 

Pro Gly Pro Ala Gly Pro Lys Gly Ser Pro Gly Glu Ala Gly Arg Pro 
355 360 365 

Gly Glu Ala Gly Leu Pro Gly Ala Lys Gly Leu Thr Gly Ser Pro Gly 
370 375 380 

Ser Pro Gly Pro Asp Gly Lys Thr Gly Pro Pro Gly Pro Ala Gly Gin 
385 390 395 400 

Asp Gly Arg Pro Gly Pro Pro Gly Pro Pro Gly Ala Arg Gly Gin Ala 
405 410 415 

Gly Val Met Gly Phe Pro Gly Pro Lys Gly Ala Ala Gly Glu Pro Gly 
420 425 430 

Lys Ala Gly Glu Arg Gly Val Pro Gly Pro Pro Gly Ala Val Gly Pro 
435 440 445 

Ala Gly Lys Asp Gly Glu Ala Gly Ala Gin Gly Pro Pro Gly Pro Ala 
450 455 460 

Gly Pro Ala Gly Glu Arg Gly Glu Gin Gly Pro Ala Gly Ser Pro Gly 
465 470 475 480 

Phe Gin Gly Leu Pro Gly Pro Ala Gly Pro Pro Gly Glu Ala Gly Lys 
485 490 4*5 

Pro Gly Glu Gin Gly Val Pro Gly Asp Leu Gly Ala Pro Gly Pro Ser 
500 505 510 

Gly Ala Arg Gly Glu Arg Gly Phe Pro Gly Glu Arg Gly Val Gin Gly 
515 520 525 

Pro Pro Gly Pro Ala Gly Pro Arg Gly Ala Asn Gly Ala Pro Gly Asn 
530 535 540 

Asp Gly Ala Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly Ser Gin 
545 550 555 560 

Gly Ala Pro Gly Leu Gin Gly Met Pro Gly Glu Arg Gly Ala Ala Gly 



565 



570 



57 5 



50 



Leu Pro Gly Pro Lys Gly Asp Arg Gly Asp Ala Gly Pro Lys Gly Ala 

580 585 590 

Asp Gly Ser Pro Gly Lys Asp Gly Val Arg Gly Leu Thr Gly Pro He 

595 600 60S 

Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly Asp Lys Gly Glu Ser Gly 

610 615 620 

Pro Ser Gly Pro Ala Gly Pro Thr Gly Ala Arg Gly Ala Pro Gly Asp 
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625 630 635 



640 



Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Phe Ala Gly Pro Pro 
645 650 655 

Gly Ala Asp Gly Gin Pro Gly Ala Lys Gly Glu Pro Gly Asp Ala Gly 
660 665 670 

Ala Lys Gly Asp Ala Gly Pro Pro Gly Pro Ala Gly Pro Ala Gly Pro 
675 680 685 

Pro Gly Pro lie Gly Asn Val Gly Ala Pro Gly Ala Lys Gly Ala Arg 
690 695 700 

Gly Ser Ala Gly Pro Pro Gly Ala Thr Gly Phe Pro Gly Ala Ala Gly 
705 710 715 • 720 

Arg Val Gly Pro Pro Gly Pro Ser Gly Asn Ala Gly Pro Pro Gly Pro 
725 730 735 

Pro Gly Pro Ala Gly Lys Glu Gly Gly Lys Gly Pro Arg Gly Glu Thr 
740 745 750 

Gly Pro Ala Gly Arg Pro Gly Glu Val Gly Pro Pro Gly Pro Pro Gly 
755 760 765 

Pro Ala Gly Glu Lys Gly Ser Pro Gly Ala Asp Gly Pro Ala Gly Ala 
770 775 . 780 

Pro Gly Thr Pro Gly Pro Gin Gly lie Ala Gly Gin Arg Gly Val Val 
785 790 795 800 

Gly Leu Pro Gly Gin Arg Gly Glu Arg Gly Phe Pro Gly Leu Pro Gly 
805 810 815 

Pro Ser Gly Glu Pro Gly Lys Gin Gly Pro Ser Gly Ala Ser Gly Glu 
820 825 830 

Arg Gly Pro Pro Gly Pro Met Gly Pro Pro Gly Leu Ala Gly Pro Pro 
835 840 845 

Gly Glu Ser Gly Arg Glu Gly Ala Pro Ala Ala Glu Gly Ser Pro Gly 
850 855 860 

Arg Asp Gly Ser Pro Gly Ala Lys Gly Asp Arg Gly Glu Thr Gly Pro 
865 870 875 880 

Ala Gly Pro Pro Gly Ala Xaa Gly Ala Xaa Gly Ala Pro Gly Pro Val 
885 890 895 

Gly Pro Ala Gly Lys Ser Gly Asp Arg Gly Glu Thr Gly Pro Ala Gly 
900 905 910 

Pro Ala Gly Pro Val Gly Pro Ala Gly Ala Arg Gly Pro Ala Gly Pro 
915 920 925 

Gin Gly Pro Arg Gly Asp Lys Gly Glu Thr Gly Glu Gin Gly Asp Arg 
930 935 940 

Gly lie Lys Gly His Arg Gly Phe Ser Gly Leu Gin Gly Pro Pro Gly 
945 950 955 960 

Pro Pro Gly Ser Pro Gly Glu Gin Gly Pro Ser Gly Ala Ser Gly Pro 
965 970 975 

Ala Gly Pro Arg Gly Pro Pro Gly Ser Ala Gly Ala Pro Gly Lys Asp 
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980 985 990 

Gly Leu Asn Gly Leu Pro Gly Pro lie Gly Pro Pro Gly Pro Arg Gly 
995 1000 1005 

Arg Thr Gly Asp Ala Gly Pro Val Gly Pro Pro Gly Pro Pro Gly Pro 
1010 1015 1020 

Pro Gly Pro Pro Gly Pro Pro Ser Ala Gly Phe Asp Phe Ser Phe Leu 
1025 1030 1035 1040 

Pro Gin Pro Pro Gin Glu Lys Ala His Asp Gly Gly Arg Tyr Tyr Arg 
1045 1050 1055 

Ala Arg Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys Arg Arg His 
1060 1065 1070 ' 

Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp lie Val 
1075 1080 1085 

Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly Asp Cys Pro Phe 
1090 1095 1100 

Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala lie Val Gin Thr 
1105 1110 1115 1120 

Leu Val Asn Ser Val Asn Ser Ser lie Pro Lys Ala Cys Cys Val Pro 
1125 1130 1135 

Thr Glu Leu Ser Ala lie Ser Met Leu Tyr Leu Asp Glu Tyr Asp Lys 
1140 1145 1150 

Val Val Leu Lys Asn Tyr Gin Glu Met Val Val Glu Gly Cys Gly Cys 
1155 1160 1165 

Arg 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1171 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

• (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Gin Leu Ser Tyr Gly Tyr Asp Glu Lys Ser Thr Gly Gly lie Ser Val 
15 10 15 

Pro Gly Pro Met Gly Pro Ser Gly Pro Arg Gly Leu Pro Gly Pro Pro 
20 25 30 

Gly Ala Pro Gly Pro Gin Gly Phe Gin Gly Pro Pro Gly Glu Pro Gly 
35 40 45 

Glu Pro Gly Ala Ser Gly Pro Met Gly Pro Arg Gly Pro Pro Gly Pro 
50 55 60 

Pro Gly Lys Asn Gly Asp Asp Gly Glu Ala Gly Lys Pro Gly Arg Pro 
65 70 75 80 

Gly Glu Arg Gly Pro Pro Gly Pro Gin Gly Ala Arg Gly Leu Pro Gly 
85 90 95 
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Thr Ala Gly Leu Pro Gly Met Lys Gly His Arg Gly Phe Ser Gly Leu 
100 105 110 

Asp Gly Ala Lys Gly Asp Ala Gly Pro Ala Gly Pro Lys Gly Glu Pro 
115 120 125 

Gly Ser Pro Gly Glu Asn Gly Ala Pro Gly Gin Met Gly Pro Arg Gly 
130 135 140 

Leu Pro Gly Glu Arg Gly Arg Pro Gly Ala Pro Gly Pro Ala Gly Ala 
145 150 155 160 

Arg Gly Asn Asp Gly Ala Thr Gly Ala Ala Gly Pro Pro Gly Pro Thr 
165 170 175 

Gly Pro Ala Gly Pro Pro Gly Phe Pro Gly Ala Val Gly Ala Lys Gly 
180 185 190 

Glu Ala Gly Pro Gin Gly Pro Arg Gly Ser Glu Gly Pro Gin Gly Val 
195 200 205 

Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Ala Ala Gly Pro Ala 
210 215 220 

Gly Asn Pro Gly Ala Asp Gly Gin Pro Gly Ala Lys Gly Ala Asn Gly 
225 230 235 240 

Ala Pro Gly lie Ala Gly Ala Pro Gly Phe Pro Gly Ala Arg Gly Pro 
245 250 255 

Ser Gly Pro Gin Gly Pro Gly Gly Pro Pro Gly Pro Lys Gly Asn Ser 
260 265 270 

Gly Glu Pro Gly Ala Pro Gly Ser Lys Gly Asp Thr Gly Ala Lys Gly 
275 280 285 

Glu Pro Gly Pro Val Gly Val Gin Gly Pro Pro Gly Pro Ala Gly Glu 
290 295 300 

Glu Gly Lys Arg Gly Ala Arg Gly Glu Pro Gly Pro Thr Gly Leu Pro 
305 310 315 320 

Gly Pro Pro Gly Glu Arg Gly Gly Pro Gly Ser Arg Gly Phe Pro Gly 
325 330 335 

Ala Asp Gly Val Ala Gly Pro Lys Gly Pro Ala Gly Glu Arg Gly Ser 
340 345 350 

Pro Gly Pro Ala Gly Pro Lys Gly Ser Pro Gly Glu Ala Gly Arg Pro 
355 360 365 

Gly Glu Ala Gly Leu Pro Gly Ala Lys Gly Leu Thr Gly Sec Pro Gly 
370 375 380 

Ser Pro Gly Pro Asp Gly Lys Thr Gly Pro Pro Gly Pro Ala Gly Gin 
385 390 395 400 

Asp Gly Arg Pro Gly Pro Pro Gly Pro Pro Gly Ala Arg Gly Gin Ala 
405 410 415 

Gly Val Met Gly Phe Pro Gly Pro Lys Gly Ala Ala Gly Glu Pro Gly 
420 425 430 

Lys Ala Gly Glu Arg Gly Val Pro Gly Pro Pro Gly Ala Val Gly Pro 
435 440 445 
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Ala Gly Lys Asp Gly Glu Ala Gly Ala Gin Gly Pro Pro Gly Pro Ala 
450 455 460 

Gly Pro Ala Gly Glu Arg Gly Glu Gin Gly Pro Ala Gly Ser Pro Gly 

465 470 475 480 

Phe Gin Gly Leu Pro Gly Pro Ala Gly Pro Pro Gly Glu Ala Gly Lys 
485 490 495 



10 



Pro Gly Glu Gin Gly Val Pro Gly Asp Leu Gly Ala Pro Gly Pro Ser 
500 505 510 



Gly Ala Arg Gly Glu Arg Gly Phe Pro Gly Glu Arg Gly Val Gin Gly 
515 520 525 



15 



Pro Pro Gly Pro Ala Gly Pro Arg Gly Ala Asn Gly Ala Pro Gly Asn 
530 535 540 



Asp Gly Ala Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly Ser Gin 
545 550 555 560 



20 



Gly Ala Pro Gly Leu Gin Gly Met Pro Gly Glu Arg Gly Ala Ala Gly 
565 570 575 



25 



30 



Leu Pro Gly Pro Lys Gly Asp Arg Gly Asp Ala Gly Pro Lys Gly Ala 
580 585 590 

Asp Gly Ser Pro Gly Lys Asp Gly Val Arg Gly Leu Thr Gly Pro lie 
595 600 605 

Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly Asp Lys Gly Glu Ser Gly 
610 615 620 

Pro Ser Gly Pro Ala Gly Pro Thr Gly Ala Arg Gly Ala Pro Gly Asp 
625 630 635 640 

Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Phe Ala Gly Pro Pro 
645 650 655 



35 



Gly Ala Asp Gly Gin Pro Gly Ala Lys Gly Glu Pro Gly Asp Ala Gly 
660 665 670 

Ala Lys Gly Asp Ala Gly Pro Pro Gly Pro Ala Gly Pro Ala Gly Pro 
675 680 685 



40 



Pro Gly Pro He Gly Asn Val Gly Ala Pro Gly Ala Lys Gly Ala Arg 
690 695 700 

Gly Ser Ala Gly Pro Pro Gly Ala Thr Gly Phe Pro Gly Ala Ala Gly 
705 710 715 720 



Arg Val Gly Pro Pro Gly Pro Ser Gly Asn Ala Gly Pro Pro Gly Pro 
725 730 735 

Pro Gly Pro Ala Gly Lys Glu Gly Gly Lys Gly Pro Arg Gly Glu Thr 
740 745 750 

Gly Pro Ala Gly Arg Pro Gly Glu Val Gly Pro Pro Gly Pro Pro Gly 
755 760 765 



50 



Pro Ala Gly Glu Lys Gly Ser Pro Gly Ala Asp Gly Pro Ala Gly Ala 
770 775 780 



Pro Gly Thr Pro Gly Pro Gin Gly He Ala Gly Gin Arg Gly Val Val 
785 790 795 800 
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Gly Leu Pro Gly Gin Arg Gly Glu Arg Gly Phe Pro Gly Leu Pro Gly 
805 810 815 

Pro Ser Gly Glu Pro Gly Lys Gin Gly Pro Ser Gly Ala Ser Gly Glu 
820 825 830 

Arg Gly Pro Pro Gly Pro Met Gly Pro Pro Gly Leu Ala Gly Pro Pro 
835 840 845 

Gly Glu Ser Gly Arg Glu Gly Ala Pro Ala Ala Glu Gly Ser Pro Gly 
850 855 860 

Arg Asp Gly Ser Pro Gly Ala Lys Gly Asp Arg Gly Glu Thr Gly Pro 
865 870 875 880 

Ala Gly Pro Pro Gly Ala Xaa Gly Ala Xaa Gly Ala Pro Gly Pro Val 
885 890 895 

Gly Pro Ala Gly Lys Ser Gly Asp Arg Gly Glu Thr Gly Pro Ala Gly 
900 905 910 

Pro Ala Gly Pro Val Gly Pro Ala Gly Ala Arg Gly Pro Ala Gly Pro 
915 920 925 

Gin Gly Pro Arg Gly Asp Lys Gly Glu Thr Gly Glu Gin Gly Asp Arg 
930 935 940 

Gly lie Lys Gly His Arg Gly Phe Ser Gly Leu Gin Gly Pro Pro Gly 
945 950 955 960 

Pro Pro Gly Ser Pro Gly Glu Gin Gly Pro Ser Gly Ala Ser Gly Pro 
965 970 975 

Ala Gly Pro Arg Gly Pro Pro Gly Ser Ala Gly Ala Pro Gly Lys Asp 
980 985 990 

Gly Leu Asn Gly Leu Pro Gly Pro lie Gly Pro Pro Gly Pro Arg Gly 
995 1000 1005 

Arg Thr Gly Asp Ala Gly Pro Val Gly Pro Pro Gly Pro Pro Gly Pro 
1010 1015 1020 

Pro Gly Pro Pro Gly Pro Pro Ser Ala Gly Phe Asp Phe Ser Phe Leu 
1025 1030 1035 1040 

Pro Gin Pro Pro Gin Glu Lys Ala His Asp Gly Gly Arg Tyr Tyr Arg 
1045 1050 1055 

Ala Arg Ser Ala Leu Asp Thr Asn Tyr Cys Phe Ser Ser Thr Glu Lys 
1060 1065 1070 

Asn Cys Cys Val Arg Gin Leu Tyr lie Asp Phe Arg Lys Asp Leu Gly 
1075 1080 1085 

Trp Lys Trp He His Glu Pro Lys Gly Tyr His Ala Asn Phe Cys Leu 
1090 1095 1100 

Gly Pro Cys Pro Tyr He Trp Ser Leu Asp Thr Gin Tyr Ser Lys Val 
1105 1110 ' 1115 1120 

Leu Ala Leu Tyr Asn Gin His Asn Pro Gly Ala Ser Ala Ala Pro Cys 
1125 1130 1135 

Cys Val Pro Gin Ala Leu Glu Pro Leu Pro He Val Tyr Tyr Val Gly 
1140 1145 1150 



27 



EP 0 704 532 A2 



Arg Lys Pro Lys Val Glu Gin Leu Ser Asn Met lie Val Arg Ser Cys 
1155 1160 1165 



Lys Cys Ser 
1170 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1388 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Gin Leu Ser Tyr Gly Tyr Asp Glu Lys Ser Thr Gly Gly He Ser Val 
15 10 15 

Pro Gly Pro Met Gly Pro Ser Gly Pro Arg Gly Leu Pro Gly Pro Pro 
20 25 30 

Gly Ala Pro Gly Pro Gin Gly Phe Gin Gly Pro Pro Gly Glu Pro Gly 
35 40 45 

Glu Pro Gly Ala Ser Gly Pro Met Gly Pro Arg Gly Pro Pro Gly Pro 
50 55 60 

Pro Gly Lys Asn Gly Asp Asp Gly Glu Ala Gly Lys Pro Gly Arg Pro 
65 70 75 80 

Gly Glu Arg Gly Pro Pro Gly Pro Gin Gly Ala Arg Gly Leu Pro Gly 
85 90 95 

Thr Ala Gly Leu Pro Gly Met Lys Gly His Arg Gly Phe Ser Gly Leu 
100 105 110 

Asp Gly Ala Lys Gly Asp Ala Gly Pro Ala Gly Pro Lys Gly Glu Pro 
115 120 125 

Gly Ser Pro Gly Glu Asn Gly Ala Pro Gly Gin Met Gly Pro Arg Gly 
130 135 140 

Leu Pro Gly Glu Arg Gly Arg Pro Gly Ala Pro Gly Pro Ala Gly Ala 
145 150 155 160 

Arg Gly Asn Asp Gly Ala Thr Gly Ala Ala Gly Pro Pro Gly Pro Thr 
165 170 175 

Gly Pro Ala Gly Pro Pro Gly Phe Pro Gly Ala Val Gly Ala Lys Gly 
180 185 190 

Glu Ala Gly Pro Gin Gly Pro Arg Gly Ser Glu Gly Pro Gin Gly Val 
195 200 205 

Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Ala Ala Gly Pro Ala 
210 215 220 

Gly Asn Pro Gly Ala Asp Gly Gin Pro Gly Ala Lys Gly Ala Asn Gly 
225 230 235 240 

Ala Pro Gly He Ala Gly Ala Pro Gly Phe Pro Gly Ala Arg Gly Pro 
245 250 255 
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Ser Gly Pro Gin Gly Pro GLy Gly Pro Pro Gly Pro Lys Gly Asn Sec 
260 265 270 

Gly Glu Fro Gly Ala Pro Gly Ser Lys Gly Asp Thr Gly Ala Lys Gly 
275 280 285 

Glu Pro Gly Pro Val Gly Val Gin Gly Pro Pro Gly Pro Ala Gly Glu 
290 295 300 

Glu Gly Lys Arg Gly Ala Arg Gly Glu Pro Gly Pro Thr Gly Leu Pro 
305 310 315 320 

Gly Pro Pro Gly Glu Arg Gly Gly Pro Gly Ser Arg Gly Phe Pro Gly 
325 330 335 

Ala Asp Gly Val Ala Gly Pro Lys Gly Pro Ala Gly Glu Arg Gly Ser 
340 345 350 

Pro Gly Pro Ala Gly Pro Lys Gly Ser Pro Gly Glu Ala Gly Arg Pro 
355 360 365 

Gly Glu Ala Gly Leu Pro Gly Ala Lys Gly Leu Thr Gly Ser Pro Gly 
370 375 380 

Ser Pro Gly Pro Asp Gly Lys Thr Gly Pro Pro Gly Pro Ala Gly Gin 
385 390 395 400 

Asp Gly Arg Pro Gly Pro Pro Gly Pro Pro Gly Ala Arg Gly Gin Ala 
405 410 415 

Gly Val Met Gly Phe Pro Gly Pro Lys Gly Ala Ala Gly Glu Pro Gly 
420 425 430 

Lys Ala Gly Glu Arg Gly Val Pro Gly Pro Pro Gly Ala Val Gly Pro 
435 440 445 

Ala Gly Lys Asp Gly Glu Ala Gly Ala Gin Gly Pro Pro Gly Pro Ala 
450 455 460 

Gly Pro Ala Gly Glu Arg Gly Glu Gin Gly Pro Ala Gly Ser Pro Gly 
465 470 475 480 

Phe Gin Gly Leu Pro GLy Pro Ala Gly Pro Pro Gly Glu Ala Gly Lys 
485 490 495 

Pro Gly Glu Gin Gly Val Pro Gly Asp Leu Gly Ala Pro Gly Pro Ser 
500 505 510 

Gly Ala Arg Gly Glu Arg Gly Phe Pro Gly Glu Arg Gly Val Gin Gly 
515 520 525 

Pro Pro Gly Pro Ala Gly Pro Arg Gly Ala Asn Gly Ala Pro Gly Asn 
530 535 540 

Asp Gly Ala Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly Ser Gin 
545 550 555 560 

Gly Ala Pro Gly Leu Gin Gly Met Pro Gly Glu Arg Gly Ala Ala Gly 
565 570 575 

Leu Pro Gly Pro Lys Gly Asp Arg Gly Asp Ala Gly Pro Lys Gly Ala 
580 585 590 

Asp Gly Ser Pro Gly Lys Asp Gly Val Arg Gly Leu Thr Gly Pro lie 
595 600 605 
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Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly Asp Lys Gly Glu Ser Gly 
610 615 620 

Pro Ser Gly Pro Ala Gly Pro Thr Gly Ala Arg Gly Ala Pro Gly Asp 
625 630 635 640 

Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Phe Ala Gly Pro Pro 
645 650 655 



10 



Gly Ala Asp Gly Gin Pro Gly Ala Lys Gly Glu Pro Gly Asp Ala Gly 
660 665 670 



Ala Lys Gly Asp Ala Gly Pro Pro Gly Pro Ala Gly Pro Ala Gly Pro 
675 680 685 



15 



Pro Gly Pro He Gly Asn Val Gly Ala Pro Gly Ala Lys Gly Ala Arg 
690 695 700 



25 



30 



Gly Ser Ala Gly Pro Pro Gly Ala Thr Gly Phe Pro Gly Ala Ala Gly 
705 710 715 720 

Arg Val Gly Pro Pro Gly Pro Ser Gly Asn Ala Gly Pro Pro Gly Pro 
725 730 735 

Pro Gly Pro Ala Gly Lys Glu Gly Gly Lys Gly Pro Arg Gly Glu Thr 
740 745 750 

Gly Pro Ala Gly Arg Pro Gly Glu Val Gly Pro Pro Gly Pro Pro Gly 
755 760 765 

Pro Ala Gly Glu Lys Gly Ser Pro Gly Ala Asp Gly Pro Ala Gly Ala 
770 775 780 

Pro Gly Thr Pro Gly Pro Gin Gly He Ala Gly Gin Arg Gly Val Val 
785 790 795 800 

Gly Leu Pro Gly Gin Arg Gly Glu Arg Gly Phe Pro Gly Leu Pro Gly 
805 810 815 



35 



Pro Ser Gly Glu Pro Gly Lys Gin Gly Pro Ser Gly Ala Ser Gly Glu 

820 825 830 

Arg Gly Pro Pro Gly Pro Met Gly Pro Pro Gly Leu Ala Gly Pro Pro 

835 840 845 



40 



Gly Glu Ser Gly Arg Glu Gly Ala Pro Ala Ala Glu Gly Ser Pro Gly 
850 855 860 

Arg Asp Gly Ser Pro Gly Ala Lys Gly Asp Arg Gly Glu Thr Gly Pro 

865 870 875 880 

Ala Gly Pro Pro Gly Ala Xaa Gly Ala Xaa Gly Ala Pro Gly Pro Val 

885 890 895 



45 



Gly Pro Ala Gly Lys Ser Gly Asp Arg Gly Glu Thr Gly Pro Ala Gly 
900 905 910 



Pro Ala Gly Pro Val Gly Pro Ala Gly Ala Arg Gly Pro Ala Gly Pro 
915 920 925 

Gin Gly Pro Arg Gly Asp Lys Gly Glu Thr Gly Glu Gin Gly Asp Arg 

930 935 940 

Gly He Lys Gly His Arg Gly Phe Ser Gly Leu Gin Gly Pro Pro Gly 
945 950 955 960 
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Pro Pro Gly Ser Pro Gly Glu Gin Gly Pro Ser Gly Ala Ser Gly Pro 
965 970 975 

Ala Gly Pro Arg Gly Pro Pro Gly Ser Ala Gly Ala Pro Gly Lys Asp 
980 985 990 

Gly Leu Asn Gly Leu Pro Gly Pro lie Gly Pro Pro Gly Pro Arg Gly 
995 1000 1005 

Arg Thr Gly Asp Ala Gly Pro Val Gly Pro Pro Gly Pro Pro Gly Pro 
1010 1015 1020 

Pro Gly Pro Pro Gly Pro Pro Ser Ala Gly Phe Asp Phe Ser Phe Leu 
1025 1030 1035 1040 

Pro Gin Pro Pro Gin Glu Lys Ala His Asp Arg Gly Arg Tyr Tyr Arg 
1045 1050 1055 

Ala Arg Ser Asp Glu Ala Ser Gly lie Gly Pro Glu Val Pro Asp Asp 
1060 1065 1070 

Arg Asp Phe Glu Pro Ser Leu Gly Pro Val Cys Pro Phe Arg Cys Gin 
1075 1080 1085 

Cys His Leu Arg Val Val Gin Cys Ser Asp Leu Gly Leu Asp Lys Val 
1090 1095 1100 

Pro Lys Asp Leu Pro Pro Asp Thr Thr Leu Leu Asp Leu Gin Asn Asn 
1105 1110 1115 1120 

Lys lie Thr Glu lie Lys Asp Gly Asp Phe Lys Asn Leu Lys Asn Leu 
1125 1130 1135 

His Ala Leu lie Leu Val Asn Asn Lys lie Ser Lys Val Ser Pro Gly 
1140 1145 1150 

Ala Phe Thr Pro Leu Val Lys Leu Glu Arg Leu Tyr Leu Ser Lys Asn 
1155 1160 1165 

Gin Leu Lys Glu Leu Pro Glu Lys Met Pro Lys Thr Leu Gin Glu Leu 
1170 1175 1180 

Arg Ala His Glu Asn Glu lie Thr Lys Val Arg Lys Val Thr Phe Asn 
1185 1190 1195 1200 

Gly Leu Asn Gin Met lie Val lie Glu Leu Gly Thr Asn Pro Leu Lys 
1205 1210 1215 

Ser Ser Gly lie Glu Asn Gly Ala Phe Gin Gly Met Lys Lys Leu Ser 
1220 1225 1230 

Tyr lie Arg lie Ala Asp Thr Asn lie Thr Ser lie Pro Gin Gly Leu 
1235 1240 1245 

Pro Pro Ser Leu Thr Glu Leu His Leu Asp Gly Asn Lys lie Ser Arg 
1250 1255 1260 

Val Asp Ala Ala Ser Leu Lys Gly Leu Asn Asn Leu Ala Lys Leu Gly 
1265 1270 1275 1280 

Leu Ser Phe Asn Ser lie Ser Ala Val Asp Asn Gly Ser Leu Ala Asn 
1285 1290 1295 

Thr Pro His Leu Arg Glu Leu His Leu Asp Asn Asn Lys Leu Thr Arg 
1300 1305 1310 
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Val Pro Gly Gly Leu Ala Glu His Lys Tyr lie Gin Val Val Tyr Leu 
1315 1320 1325 

His Asn Asn Asn lie Ser Val Val Gly Ser Ser Asp Phe Cys Pro Pro 
1330 1335 1340 

Gly His Asn Thr Lys Lys Ala Ser Tyr Ser Gly Val Ser Leu Phe Ser 
1345 1350 1355 1360 

Asn Pro Val Gin Tyr Trp Glu lie Gin Pro Ser Thr Phe Arg Cys Val 
1365 1370 1375 

Tyr Val Arg Ser Ala lie Gin Leu Gly Asn Tyr Lys 
1380 1385 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gin Leu Ser Tyr Gly Tyr Asp Glu Lys Ser Thr Gly Gly lie Ser Val 
1 5 10 15 

Pro Gly Pro Met Gly Pro Ser Gly Pro Arg Gly Leu Pro Gly Pro Pro 
20 25 30 

Gly Ala Pro Gly Pro Gin Gly Phe Gin Gly Pro Pro Gly Glu Pro Gly 
35 40 45 

Glu Pro Gly Ala Ser Gly Pro Met Gly Pro Arg Gly Pro Pro Gly Pro 
50 55 60 

Pro Gly Lys Asn Gly Asp Asp Gly Glu Ala Gly Lys Pro Gly Arg Pro 

65 70 75 80 

Gly Glu Arg Gly Pro Pro Gly Pro Gin Gly Ala Arg Gly Leu Pro Gly 
85 90 95 

Thr Ala Gly Leu Pro Gly Met Lys Gly His Arg Gly Phe Ser Gly Leu 
100 105 110 

Asp Gly Ala Lys Gly Asp Ala Gly Pro Ala Gly Pro Lys Gly Glu Pro 
115 120 125 

Gly Ser Pro Gly Glu Asn Gly Ala Pro Gly Gin Met Gly Pro Arg Gly 
130 135 140 

Leu Pro Gly Glu Arg Gly Arg Pro Gly Ala Pro Gly Pro Ala Gly Ala 
145 150 155 160 

Arg Gly Asn Asp Gly Ala Thr Gly Ala Ala Gly Pro Pro Gly Pro Thr 
165 170 175 

Gly Pro Ala Gly Pro Pro Gly Phe Pro Gly Ala Val Gly Ala Lys Gly 
180 185 190 

Glu Ala Gly Pro Gin Gly Pro Arg Gly Ser Glu Gly Pro Gin Gly Val 
195 200 205 

Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala Gly Ala Ala Gly Pro Ala 



32 



EP 0 704 532 A2 



210 215 220 

Gly Asn Pro Gly Ala Asp Gly Gin Pro Gly Ala Lys Gly Ala Asn Gly 
225 230 235 240 

Ala Pro Gly He Ala Gly Ala Pro Gly Phe Pro Gly Ala Arg Gly Pro 
245 250 255 

Ser Gly Pro Gin Gly Pro Gly Gly Pro Pro Gly Pro Lys Gly Asn Ser 
260 265 270 

Gly Glu Pro Gly Ala Pro Gly Ser Lys Gly Asp Thr Gly Ala Lys Gly 
275 280 285 

Glu Pro Gly Pro Val Gly Val Gin Gly Pro Pro Gly Pro Ala Gly Glu 
290 295 300 

Glu Gly Lys Arg Gly Ala Arg Gly Glu Pro Gly Pro Thr Gly Leu Pro 
305 310 315 320 

Gly Pro Pro Gly Glu Arg Gly Gly Pro Gly Ser Arg Gly Phe Pro Gly 
325 330 335 

Ala Asp Gly Val Ala Gly Pro Lys Gly Pro Ala Gly Glu Arg Gly Ser 
340 345 350 

Pro Gly Pro Ala Gly Pro Lys Gly Ser Pro Gly Glu Ala Gly Arg Pro 
355 360 365 

Gly Glu Ala Gly Leu Pro Gly Ala Lys Gly Leu Thr Gly Ser Pro Gly 
370 375 380 

Ser Pro Gly Pro Asp Gly Lys Thr Gly Pro Pro Gly Pro Ala Gly Gin 
3B5 390 395 400 

Asp Gly Arg Pro Gly Pro Pro Gly Pro Pro Gly Ala Arg Gly Gin Ala 
4C5 410 415 

Gly Val Met Gly Phe Pro Gly Pro Lys Gly Ala Ala Gly Glu Pro Gly 
420 425 430 

Lys Ala Gly Glu Arg Gly Val Pro Gly Pro Pro Gly Ala Val Gly Pro 
435 440 445 

Ala Gly Lys Asp Gly Glu Ala Gly Ala Gin Gly Pro Pro Gly Pro Ala 
450 455 460 

Gly Pro Ala Gly Glu Arg Gly Glu Gin Gly Pro Ala Gly Ser Pro Gly 
465 470 475 480 

Phe Gin Gly Leu Pro Gly Pro Ala Gly Pro Pro Gly Glu Ala Gly Lys 
485 490 495 

Pro Gly Glu Gin Gly Val Pro Gly Asp Leu Gly Ala Pro Gly Pro Ser 
500 505 510 

Gly Ala Arg Gly Glu Arg Gly Phe Pro Gly Glu Arg Gly Val Gin Gly 
515 520 525 

Pro Pro Gly Pro Ala Gly Pro Arg Gly Ala Asn Gly Ala Pro Gly Asn 
530 535 540 

Asp Gly Ala Lys Gly Asp Ala Gly Ala Pro Gly Ala Pro Gly Ser Gin 
545 550 555 560 

Gly Ala Pro Gly Leu Gin Gly Met Pro Gly Glu Arg Gly Ala Ala Gly 
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565 570 575 

Leu Pro Gly Pro Lys Gly Asp Arg Gly Asp Ala Gly Pro Lys Gly Ala 
580 585 590 

Asp Gly Ser Pro Gly Lys Asp Gly Val Arg Gly Leu Thr Gly Pro lie 
595 600 605 

Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly Asp Lys Gly Glu Ser Gly 
610 615 620 

Pro Ser Gly Pro Ala Gly Pro Thr Gly Ala Arg Gly Ala Pro Gly Asp 
625 630 635 640 

Arg Cly Glu Pro Gly Pro Pro Gly Pro Ala Gly Phe Ala Gly Pro Pro 
645 650 655 

Gly Ala Asp Gly Gin Pro Gly Ala Lys Gly Glu Pro Gly Asp Ala Gly 
660 665 670 

Ala Lys Gly Asp Ala Gly Pro Pro Gly Pro Ala Gly Pro Ala Gly Pro 
675 680 685 

Pro Gly Pro lie Gly Asn Val Gly Ala Pro Gly Ala Lys Gly Ala Arg 
690 695 700 

Gly Ser Ala Gly Pro Pro Gly Ala Thr Gly Phe Pro Gly Ala Ala Gly 
705 710 715 720 

Arg Val Gly Pro Pro Gly Pro Ser Gly Asn Ala Gly Pro Pro Gly Pro 
725 730 735 

Pro Gly Pro Ala Gly Lys Glu Gly Gly Lys Gly Pro Arg Gly Glu Thr 
740 745 750 

Gly Pro Ala Gly Arg Pro Gly Glu Val Gly Pro Pro Gly Pro Pro Gly 
755 760 765 

Pre Als Gly Glu Lys Gly Ser Pro Gly Ala Asp Gly Pro Ala Gly Ala 
770 775 780 

Pro Gly Thr Pro Gly Pro Gin Gly lie Ala Gly Gin Arg Gly Val Val 
785 790 795 800 

Gly Leu Pro Gly Gin Arg Gly Glu Arg Gly Phe Pro Gly Leu Pro Gly 
805 810 815 

Pro Ser Gly Glu Pro Gly Lys Gin Gly Pro Ser Gly Ala Ser Gly Glu 
820 825 830 

Arg Gly Pro Pro Gly Pro Met Gly Pro Pro Gly Leu Ala Gly Pro Pro 
835 840 845 

Gly Glu Ser Gly Arg Glu Gly Ala Pro Ala Ala Glu Gly Ser Pro Gly 
850 855 860 

Arg Asp Gly Ser Pro Gly Ala Lys Gly Asp Arg Gly Glu Thr Gly Pro 
B65 670 875 880 

Ala Gly Pro Pro Gly Ala Xaa Gly Ala Xaa Gly Ala Pro Gly Pro Val 
885 890 895 

Gly Pro Ala Gly Lys Ser Gly Asp Arg Gly Glu Thr Gly Pro Ala Gly 
900 905 910 

Pro Ala Gly Pro Val Gly Pro Ala Gly Ala Arg Gly Pro Ala Gly Pro 
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915 920 925 

Gin Gly Pro Arg Gly Asp Lys Gly Glu Thr Gly Glu Gin Gly Asp Arg 
930 935 940 

Gly lie Lys Gly His Arg Gly Phe Ser Gly Leu Gin Gly Pro Pro Gly 
945 950 955 960 

Pro Pro Gly Ser Pro Gly Glu Gin Gly Pro Ser Gly Ala Ser Gly Pro 
965 970 975 

Ala Gly Pro Arg Gly Pro Pro Gly Ser Ala Gly Ala Pro Gly Lys Asp 
is 9B0 985 990 

Gly Leu Asn Gly Leu Pro Gly Pro lie Gly Pro Pro Gly Pro Arg Gly 
995 1000 1005 

Arg Thr Gly Asp Ala Gly Pro Val Gly Pro Pro Gly Pro Pro Gly Pro 
20 1010 1015 1020 

Pro Gly Pro Pro Gly Pro Pro Ser Ala Gly Phe Asp Phe Ser Phe Leu 
1025 1030 1035 1040 

25 Pro Gin Pro Pro Gin Glu Lys Ala His Asp Gly Gly Arg Tyr Tyr Arg 

1045 1050 1055 

Ala Arg Ser Pro Lys Asp Leu Pro Pro Asp Thr Thr Leu Leu Asp Leu 
1060 1065 1070 

30 

Gin Asn Asn Lys lie Thr Glu lie Lys Asp Gly Asp Phe Lys Asn Leu 
1075 1080 1085 

Lys Asn Leu His Ala Leu lie Leu Val Asn Asn Lys lie Ser Lys Val 
1090 1095 1100 
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Claims 

45 1 . A chimeric DNA construct comprising a domain derived from a DNA sequence encoding a cellular regulatory factor 
and a domain derived from a DNA sequence encoding an extracellular matrix protein. 

2. A chimeric DNA construct according to claim 1 , wherein said extracellular matrix protein is selected from the group 
consisting of collagen, laminin, fibronectin, elastin and fibrin. 

so 

3. A chimeric DNA construct according to claim 1 or 2 wherein said cellular regulatory factor is selected from the group 
consisting of BMP, TGF-p, and decorin. 

4. A chimeric DNA construct according to claim 1 or 2 wherein said cellular regulatory factor is selected from the group 
55 consisting of, a BMP fragment, a TGF-p fragment and a decorin peptide. 

5. The DNA construct according to claim 3, wherein said BMP protein comprises BMP-2B. 

6. A cloning vector comprising a DNA construct according to any one of claims 1 to 5. 
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7. A cloning vector according to claim 6, wherein said cloning vector is selected from the group consisting of plasmids, 
phages, cosmids and artificial chromosomes. 

8. A cloning vector according to claim 6 or 7, wherein said cloning vector is pMal. 

9. A cell transformed by a cloning vector according to any one of claims 6 to 8. 

1 0. A cell according to claim 9 wherein said cell is selected from the group consisting of E. Coli, HeLa, 3T3, CHO, SP2, 
Sf9. Sf21, and High Five. 

10 

11. A chimeric protein comprising a domain derived from a cellular regulatory factor and a domain derived from an 
extracellular matrix protein. 

12. A chimeric protein according to claim 1 1 , wherein said extracellular matrix protein is selected from the group con- 
15 sisting of collagen, fibronectin, elastin, laminin and fibrin. 

13. A chimeric protein according to claim 1 1 or 12, wherein sajd cellular regulatory factor is selected from the group 
consisting of BMP, TGF-p, decorin and a decorin peptide. 

20 1 4. A method of manufacturing a chimeric cellular regulatory factor/extracellular matrix protein comprising: transforming 
a cell with the vector according to any one of claims 6 to 8; culturing said cell in a suitable culture medium; and 
obtaining said chimeric cellular regulatory factor/extracellular matrix protein from said culture medium. 

1 5. A pharmaceutical vehicle for delivery of a therapeutically active substance comprising a chimeric protein having at 
25 least two domains, wherein one of said domains is at least a portion of an extracellular matrix protein and another 

of said domains is at least a portion of a therapeutically active moiety and said domains are covalently linked. 

1 6. A pharmaceutical composition comprising a chimeric protein comprising a domain derived from a cellular regulatory 
factor and a domain derived from an extracellular matrix protein and a pharmaceutical^ acceptable vehicle. 

30 

17. A pharmaceutical composition according to claim 16, wherein said extracellular matrix protein is selected from the 
group consisting of collagen, fibronectin, elastin and fibrin. 

18. An pharmaceutical composition according to claim 16 or 1 7, wherein said cellular regulatory factor is selected from 
35 the group consisting of BMP. TGF-p, decorin and a decorin peptide. 

19. A pharmaceutical composition according to any one of claims 16 to 18, wherein said vehicle comprises a material 
selected from the group consisting of bioabsorbable polymers, bicompatible nonabsorbable polymers, lactoner putty 
and plaster of Paris. 

40 

20. A pharmaceutical composition according to claim 19, wherein said material is selected from the group consisting 
of lactide, glycolide, trimethylene carbonate, dioxanone, caprolactone, polymethylmethacrylate and hydroxyethyl- 
methacrylate. 

45 21. A method of preparing a DNA construct comprising: providing DNA which encodes a cellular regulatory factor or 
fragment thereof; 

providing DNA which encodes an extracellular matrix protein or fragment thereof; and operably linking said 
cellular regulatory factor or fragment thereof encoding DNA to said extracellular matrix protein or fragment thereof 
encoding DNA to form a chimeric DNA construct. 

50 

22. Use of a chimeric protein according to any one of claims 1 1 to 13 or a pharmaceutical composition according to 
any one of claims 16 to 20 for the manufacture of a medicament for the prevention or treatment of disease. 

23. Use of a chimeric protein according to claim 1 3 or a pharmaceutical composition according to any one of claims 1 8 
55 to 20 for the manufacture of a osteogenic agent 

24. Use of a chimeric protein according to claim 1 3 or a pharmaceutical composition according to any one of claims 1 8 
to 20, wherein said cellular regulatory factor is BMP, for the manufacture of a medicament for inducing bone and/or 
cartilage formation. 
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25. Use of a chimeric protein according to claim 13 or a pharmaceutical composition according to any one of claims 18 
to 20, wherein said cellular regulatory factor is TGF-p, for the manufacture of a medicament for inducing soft tissue 
repair. 

5 26. Use of a chimeric protein according to claim 1 3 or a pharmaceutical composition according to any one of claims 18 
to 20, wherein said cellular regulatory factor is decorin or a decorin peptide for the manufacture of a medicament 
for reducing scar formation. 
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2. Claims 1-4,6-23 partially and 25 completely : 
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protein and a transforming growth factor-beta. 
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Chimeric proteins containing an extracellular matrix 
protein and a decorin. 
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